An AI model that supports two or more forms of media; for example, text and images. For example, various versions of GPT and Gemini are trained on text and images. See
GPT and
multimodal.
Multimodal vs. Multimodel AI
Multimodal (multi
modal) AI differs from multimodel (multi
mahdel) AI. Whereas multimodal supports different kinds of media, multimodel works with multiple language models.