A practical reference to the major frontier and open-source models: who makes them, what they're good at, and when to use them.
Claude is a family of models from Anthropic focused on helpfulness, honesty, and safety. It's known for strong reasoning, long context windows, and careful tool use. Current lineup is organized into Opus (most capable), Sonnet (balanced), and Haiku (fast/cheap). Good default for coding agents, document analysis, and long-form reasoning.
The GPT family (GPT-4, GPT-4o, and the o-series reasoning models) pioneered modern LLMs. Strong general-purpose performance, rich tooling (function calling, structured outputs, Assistants API), and broad ecosystem support. Reasoning-focused variants trade latency for stronger multi-step thinking.
Gemini is Google's multimodal family (text, image, audio, video, code) with very long context windows. Tight integration with Google Cloud and Workspace, and competitive on multimodal benchmarks. Useful when you need native multimodal reasoning or deep GCP integration.
Llama is the most widely used open-weights family. Strong performance, permissive-ish licensing, and a massive ecosystem of fine-tunes. A good foundation when you need to run locally, fine-tune aggressively, or avoid sending data to third-party APIs.
Mistral ships both open-weights models (Mistral 7B, Mixtral MoE) and hosted frontier models (Mistral Large). Known for efficient architectures - mixture-of-experts in particular - and strong performance per dollar.
xAI's Grok family is integrated with the X platform and positioned around real-time information and less restrictive outputs. Competitive on reasoning benchmarks in its top tiers.
DeepSeek (DeepSeek-V3, R1 reasoning) and Alibaba's Qwen are leading open-weights families, often matching or exceeding Western open models on key benchmarks. Worth evaluating any time you're choosing an open baseline.
| Family | Provider | Access | Strengths | Typical use |
|---|---|---|---|---|
| Claude | Anthropic | API | Reasoning, long context, coding, tool use | Agents, doc analysis, coding assistants |
| GPT / o-series | OpenAI | API | General purpose, ecosystem, multimodal, reasoning | Broad product features, structured outputs |
| Gemini | API / GCP | Native multimodal, very long context | Video/audio, GCP-native apps | |
| Llama | Meta | Open weights | Self-hosting, fine-tunes, ecosystem | On-prem, private data, custom models |
| Mistral / Mixtral | Mistral | Open + API | Efficient MoE, cost/perf | High-volume, latency-sensitive apps |
| Grok | xAI | API / X | Real-time data, reasoning | X-integrated and news-heavy use cases |
| DeepSeek / Qwen | DeepSeek / Alibaba | Open weights | Strong open baselines, reasoning variants | Open-source alternatives to frontier APIs |
Model capabilities, pricing, and context windows change frequently. Treat this page as a map, not a spec sheet - check each provider's docs for current numbers.