Model Capabilities
What copair knows about each model out of the box.
Copair derives most per-model behaviour from generic logic (tier classifier, family-prefix format selection, tier-driven harness defaults). For values that genuinely vary per family — context window, output token limit, native-tool-calling reliability — copair ships the values below.
Honest caveat: values are conservative when uncertain. If copair ships e.g. 16k output for a model that actually supports more, override via model_overrides in your config. See config docs.
Run copair --explain-model <id> for the full resolution trace on any model ID — useful for debugging "why did copair pick X for my model?"
Frontier cloud (closed-source)
Hosted vendor APIs with reliable native tool calling. Copair uses the provider SDK directly rather than text-extraction formatters.
Anthropic Claude Opus (modern)
/^claude-opus/ctx 200kout 32ktool reliable
Anthropic Claude Opus (modern)
/^claude-opus/^claude-opusAnthropic Claude Sonnet / Haiku (modern)
/^claude-(?:sonnet|haiku)/ctx 200kout 64ktool reliable
Anthropic Claude Sonnet / Haiku (modern)
/^claude-(?:sonnet|haiku)/^claude-(?:sonnet|haiku)Anthropic Claude 3.x
/^claude-3/ctx 200kout 8.2ktool reliable
Anthropic Claude 3.x
/^claude-3/^claude-3OpenAI GPT-5 Mini
/^gpt-5-mini/ctx 400kout 4.1ktool reliable
OpenAI GPT-5 Mini
/^gpt-5-mini/^gpt-5-miniOpenAI GPT-5
/^gpt-5/ctx 400kout 16.4ktool reliable
OpenAI GPT-5
/^gpt-5/^gpt-5OpenAI GPT-4o
/^gpt-4o/ctx 128kout 16.4ktool reliable
OpenAI GPT-4o
/^gpt-4o/^gpt-4oOpenAI GPT-4
/^gpt-4/ctx 128kout 8.2ktool reliable
OpenAI GPT-4
/^gpt-4/^gpt-4OpenAI o-series
/^o[134](?:-mini|-pro)?\b/ctx 200kout 32ktool reliable
OpenAI o-series
/^o[134](?:-mini|-pro)?\b/^o[134](?:-mini|-pro)?\bGoogle Gemini 2.5+
/^gemini-(?:2-5|3)/ctx 1Mout 65.5ktool reliable
Google Gemini 2.5+
/^gemini-(?:2-5|3)/^gemini-(?:2-5|3)Google Gemini 2.x
/^gemini-2/ctx 1Mout 8.2ktool reliable
Google Gemini 2.x
/^gemini-2/^gemini-2xAI Grok 4
/^grok-4/ctx 256kout 16.4ktool reliable
xAI Grok 4
/^grok-4/^grok-4xAI Grok 1-3
/^grok-[1-3]/ctx 131.1kout 8.2ktool reliable
xAI Grok 1-3
/^grok-[1-3]/^grok-[1-3]Moonshot Kimi K2
/^kimi-k2/ctx 200kout 16.4k
Moonshot Kimi K2
/^kimi-k2/^kimi-k2MiniMax M1
/^minimax-m1/ctx 1Mout 16.4k
MiniMax M1
/^minimax-m1/^minimax-m1MiniMax M2+
/^minimax-m[2-9]/ctx 200kout 16.4k
MiniMax M2+
/^minimax-m[2-9]/^minimax-m[2-9]gpt-oss (open-weight)
/^gpt-?oss-?(?:20|120)b/ctx 131.1kout 8.2k
gpt-oss (open-weight)
/^gpt-?oss-?(?:20|120)b/^gpt-?oss-?(?:20|120)bFrontier open-weight (large)
Large open-weight models (typically 30B+ parameters). Tool calling tends to be unreliable; copair uses text-extraction with qwen-xml or dsml format depending on family.
Qwen3-Coder 480B
/^qwen3-coder-480b/ctx 262.1kout 32.8k
Qwen3-Coder 480B
/^qwen3-coder-480b/^qwen3-coder-480bQwen3 235B
/^qwen3-(?:vl-)?235b/ctx 262.1kout 16.4k
Qwen3 235B
/^qwen3-(?:vl-)?235b/^qwen3-(?:vl-)?235bQwen3-Next 80B
/^qwen3-next-80b/ctx 262.1kout 16.4k
Qwen3-Next 80B
/^qwen3-next-80b/^qwen3-next-80bAlibaba Qwen-Plus / Qwen-Turbo (1M context, Qwen3+ defaults)
/^qwen-(?:plus|turbo)/ctx 1Mout 8.2k
Alibaba Qwen-Plus / Qwen-Turbo (1M context, Qwen3+ defaults)
/^qwen-(?:plus|turbo)/^qwen-(?:plus|turbo)Alibaba Qwen3-Max
/^qwen3-max/ctx 262.1kout 32.8k
Alibaba Qwen3-Max
/^qwen3-max/^qwen3-maxMistral / Pixtral Large
/^(?:mistral|pixtral)-large/ctx 131.1kout 8.2k
Mistral / Pixtral Large
/^(?:mistral|pixtral)-large/^(?:mistral|pixtral)-largeSmall open-weight (≤ ~14B)
Small open-weight models. The small-model harness auto-engages: max-turn cap, ask_user / task_complete tools, per-turn format reminders.
Qwen3-Coder 30B
/^qwen3-coder-30b/ctx 262.1kout 16.4k
Qwen3-Coder 30B
/^qwen3-coder-30b/^qwen3-coder-30bQwen3 32B/30B
/^qwen3-(?:vl-)?(?:30b-a3b|32b)/ctx 131.1kout 8.2k
Qwen3 32B/30B
/^qwen3-(?:vl-)?(?:30b-a3b|32b)/^qwen3-(?:vl-)?(?:30b-a3b|32b)Qwen2/2.5 32B/72B
/^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)/ctx 131.1kout 8.2k
Qwen2/2.5 32B/72B
/^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)/^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)Qwen3 small (0.6B–14B)
/^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b/ctx 131.1kout 8.2k
Qwen3 small (0.6B–14B)
/^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b/^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)bQwen2/2.5 7B/14B
/^qwen2(?:-5)?-(?:coder-)?(?:7|14)b/ctx 131.1kout 8.2k
Qwen2/2.5 7B/14B
/^qwen2(?:-5)?-(?:coder-)?(?:7|14)b/^qwen2(?:-5)?-(?:coder-)?(?:7|14)bDeepSeek frontier (V3.x / R1)
/^deepseek-(?:v[34]|r[12])(?!.*-distill)/ctx 131.1kout 8.2k
DeepSeek frontier (V3.x / R1)
/^deepseek-(?:v[34]|r[12])(?!.*-distill)/^deepseek-(?:v[34]|r[12])(?!.*-distill)DeepSeek API alias
/^deepseek-(?:chat|reasoner)/ctx 131.1kout 8.2k
DeepSeek API alias
/^deepseek-(?:chat|reasoner)/^deepseek-(?:chat|reasoner)DeepSeek R1 distill ≤8B
/^deepseek-r1.*?-(?:1-5|7|8)b/ctx 131.1kout 8.2k
DeepSeek R1 distill ≤8B
/^deepseek-r1.*?-(?:1-5|7|8)b/^deepseek-r1.*?-(?:1-5|7|8)bDeepSeek R1 distill ≥14B
/^deepseek-r1.*?-(?:14|32|70)b/ctx 131.1kout 8.2k
DeepSeek R1 distill ≥14B
/^deepseek-r1.*?-(?:14|32|70)b/^deepseek-r1.*?-(?:14|32|70)bDeepSeek Coder 1.3B
/^deepseek-coder-1-3b/ctx 16.4k
DeepSeek Coder 1.3B
/^deepseek-coder-1-3b/^deepseek-coder-1-3bLlama 4 Scout (10M context — longest open-weight as of 2025)
/^llama-?4-scout/ctx 10Mout 8.2k
Llama 4 Scout (10M context — longest open-weight as of 2025)
/^llama-?4-scout/^llama-?4-scoutLlama 4 Maverick
/^llama-?4-maverick/ctx 1Mout 8.2k
Llama 4 Maverick
/^llama-?4-maverick/^llama-?4-maverickLlama 3.x large (70B+)
/^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)/ctx 131.1kout 8.2k
Llama 3.x large (70B+)
/^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)/^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)Llama 3.x small
/^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)/ctx 131.1kout 8.2k
Llama 3.x small
/^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)/^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)Mistral Codestral
/^codestral/ctx 256kout 16.4k
Mistral Codestral
/^codestral/^codestralMistral Magistral (reasoning model)
/^magistral-(?:medium|small)/ctx 131.1kout 16.4k
Mistral Magistral (reasoning model)
/^magistral-(?:medium|small)/^magistral-(?:medium|small)Mistral Pixtral 12B (vision)
/^pixtral-12b/ctx 128kout 8.2k
Mistral Pixtral 12B (vision)
/^pixtral-12b/^pixtral-12bMistral Medium / Small 3+
/^mistral-(?:medium|small-[34])/ctx 131.1kout 8.2k
Mistral Medium / Small 3+
/^mistral-(?:medium|small-[34])/^mistral-(?:medium|small-[34])Mistral Nemo 12B
/^mistral-nemo/ctx 131.1kout 8.2k
Mistral Nemo 12B
/^mistral-nemo/^mistral-nemoMixtral 8x22B
/^mixtral-8x22b/ctx 65.5kout 8.2k
Mixtral 8x22B
/^mixtral-8x22b/^mixtral-8x22bMixtral 8x7B
/^mixtral-8x7b/ctx 32.8k
Mixtral 8x7B
/^mixtral-8x7b/^mixtral-8x7bMinistral
/^ministral-(?:3|7|14)b/ctx 131.1kout 8.2k
Ministral
/^ministral-(?:3|7|14)b/^ministral-(?:3|7|14)bGLM 4.5+
/^glm-(?:[5-9]|4-[5-9])/ctx 131.1kout 8.2k
GLM 4.5+
/^glm-(?:[5-9]|4-[5-9])/^glm-(?:[5-9]|4-[5-9])GLM-4 9B
/^glm-4-9b/ctx 131.1kout 8.2k
GLM-4 9B
/^glm-4-9b/^glm-4-9bPhi-4 small
/^phi-?4-(?:mini|multimodal)/ctx 131.1k
Phi-4 small
/^phi-?4-(?:mini|multimodal)/^phi-?4-(?:mini|multimodal)Phi-4 14B
/^phi-?4(?:-14b)?\b/ctx 131.1kout 8.2k
Phi-4 14B
/^phi-?4(?:-14b)?\b/^phi-?4(?:-14b)?\bPhi-3 small
/^phi-?3(?:-5)?-(?:mini|small)/ctx 131.1k
Phi-3 small
/^phi-?3(?:-5)?-(?:mini|small)/^phi-?3(?:-5)?-(?:mini|small)Gemma 3
/^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)/ctx 131.1kout 8.2k
Gemma 3
/^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)/^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)Gemma 2 large (8k context — predates extended-context release)
/^gemma-?2-?(?:9b|27b)/ctx 8.2kout 8.2k
Gemma 2 large (8k context — predates extended-context release)
/^gemma-?2-?(?:9b|27b)/^gemma-?2-?(?:9b|27b)Gemma 2 small (8k context)
/^gemma-?2-?(?:2b|4b)/ctx 8.2k
Gemma 2 small (8k context)
/^gemma-?2-?(?:2b|4b)/^gemma-?2-?(?:2b|4b)Cohere Command R+ / R7B / A
/^command-(?:a|r-plus|r7b)/ctx 128k
Cohere Command R+ / R7B / A
/^command-(?:a|r-plus|r7b)/^command-(?:a|r-plus|r7b)Cohere Command R
/^command-r/ctx 128k
Cohere Command R
/^command-r/^command-rIBM Granite 3+/4 small-to-mid
/^granite-?[34](?:-\d+)*-(?:2|3|8|30)b/ctx 131.1kout 8.2k
IBM Granite 3+/4 small-to-mid
/^granite-?[34](?:-\d+)*-(?:2|3|8|30)b/^granite-?[34](?:-\d+)*-(?:2|3|8|30)bNVIDIA Nemotron (any size)
/^nemotron/ctx 131.1kout 8.2k
NVIDIA Nemotron (any size)
/^nemotron/^nemotronLlama-Nemotron variants
/^(?:llama-?[34](?:-\d+)*-)?nemotron/ctx 131.1kout 8.2k
Llama-Nemotron variants
/^(?:llama-?[34](?:-\d+)*-)?nemotron/^(?:llama-?[34](?:-\d+)*-)?nemotronAI21 Jamba Large / Mini
/^jamba-?(?:large|2-?large|mini|2-?mini)/ctx 256kout 8.2k
AI21 Jamba Large / Mini
/^jamba-?(?:large|2-?large|mini|2-?mini)/^jamba-?(?:large|2-?large|mini|2-?mini)Amazon Nova Pro/Premier/Lite
/^nova-(?:pro|premier|lite)/ctx 300kout 8.2k
Amazon Nova Pro/Premier/Lite
/^nova-(?:pro|premier|lite)/^nova-(?:pro|premier|lite)Amazon Nova Micro
/^nova-micro/ctx 128k
Amazon Nova Micro
/^nova-micro/^nova-microReka
/^reka-(?:core|flash|edge)/ctx 131.1kout 8.2k
Reka
/^reka-(?:core|flash|edge)/^reka-(?:core|flash|edge)Yi-Coder small
/^yi-coder-(?:1-5|9)b/ctx 131.1kout 8.2k
Yi-Coder small
/^yi-coder-(?:1-5|9)b/^yi-coder-(?:1-5|9)bCohere Aya Expanse 8B / 32B
/^aya-(?:expanse-)?(?:8b|32b)/ctx 131.1kout 8.2k
Cohere Aya Expanse 8B / 32B
/^aya-(?:expanse-)?(?:8b|32b)/^aya-(?:expanse-)?(?:8b|32b)BigCode StarCoder2 (3B/7B/15B) — note: 16k context, smaller than safe default
/^starcoder-?2/ctx 16.4k
BigCode StarCoder2 (3B/7B/15B) — note: 16k context, smaller than safe default
/^starcoder-?2/^starcoder-?2Models not listed here
If copair recognizes the model's family (it matches a classifier rule) but the family isn't a row above, the unset fields fall back to safe defaults — 32k context, 4k output, fenced-block format — while the tier comes from the family rule. Override any field per model via your config — see the configuration reference.
A model copair doesn't recognize at all stops with a clear error asking you to declare at least a tier — see custom & local models. This isn't a comprehensive registry — copair deliberately doesn't try to know about every model; generic logic + this sparse data + your overrides cover the common cases.
Last updated June 2, 2026