Aussie AI
Open Source Models
-
Book Excerpt from "Generative AI in C++"
-
by David Spuler, Ph.D.
Open Source Models
There are numerous pre-trained LLMs available for free download under open source licenses. The best known such foundation model is Meta's Llama series of models, but there are many others with quite extensive capabilities. The main advantage of these models is obvious: you can avoid the expense of training your own model.
Typically, model files are uploaded to a repository website such as GitHub or Hugging Face. These are models whose weights have been trained using a variety of different data sets. In some cases, the models come with an engine platform, but in orders you will need to use a standard engine.
Some models are licensed for research-only or other non-commercial purposes. Several model files have permissive licenses that allow any usage, including commercial purposes. For example, Meta's Llama model was first licensed for research-only, but they subsequently released Llama2 under a more permissive license.
There are also derivative models available for download, which are based on modifications made to the larger models. The most common are quantized models, where an original full-precision model with 32-bit float weights has been “quantized” down to smaller data types (e.g. 16-bit or 8-bit integers). However, there are other types of derivatives, such as smaller models trained on the outputs of larger models.
• Next: • Up: Table of Contents |
The new AI programming book by Aussie AI co-founders:
Get your copy from Amazon: Generative AI in C++ |