Models & Providers
7 min read
·┌──────────────────────────────────────────────────────────┐ │ ═══════════════════════════════════════════════════ │ │ ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │ │ ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │ │ ──────────────────────────────────────────────────── │ │ ██████████████████████████░░░░░░░░░░░░░░░░░░░░░░░░░ │ │ █████████████████████████████████░░░░░░░░░░░░░░░░░░ │ │ ██████████████████████████████████████░░░░░░░░░░░░░ │ │ ████████████████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │ │ ──────────────────────────────────────────────────── │ │ ███████████████████████████████████████░░░░░░░░░░░░ │ └──────────────────────────────────────────────────────────┘
AI models come in different types, each designed for specific kinds of tasks. Understanding these types helps you choose the right tool for your needs.
Text models process and generate written language. They're the most common type of AI model you'll encounter.
[What they do]:
[Examples]: GPT-4, Claude, Gemini Pro
[Use cases]: Chatbots, writing assistants, code generators, content creation tools
Image models work with pictures instead of text. They can analyze images or create new ones.
[What they do]:
[Examples]: DALL-E, Midjourney, Stable Diffusion
[Use cases]: Image generation, photo editing, visual search, content moderation
Multimodal models can work with multiple types of data—text, images, audio, and sometimes video.
[What they do]:
[Examples]: GPT-4 Vision, Claude 3, Gemini Ultra
[Use cases]: Visual Q&A, image analysis tools, content creation platforms
[Need to process text?] → Use a text model [Need to work with images?] → Use an image model or multimodal model [Need both?] → Use a multimodal model
Models also come in different sizes:
Many providers offer multiple sizes of the same model, letting you choose based on your needs.
Some models are specialized for specific tasks:
Understanding model types helps you make better decisions about which AI tools to use.