Term of the Moment

EFS


Look Up Another Term


Definition: AI model


An AI model is the foundation from which ChatGPT, Gemini and other chatbots are created. Models are pattern recognition systems that are fine-tuned to answer questions and/or generate original content. The architecture of a model is a matrix modeled after the human neural system (see neural network).

Language models deal with text while image models handle drawings and photos. Multimodal models support some combination of text, images, audio and video. See large language model, image model and AI secret sauce.

The AI programmer designs the neural network model with interconnections between each node called "parameters." These parameters start out with preset values and are modified in the training stages to generate better results. Large language models (LLMs) can have billions of parameters and small language models (SLMs) have hundreds of thousands or millions. See AI training vs. inference.