Model Capabilities

What copair knows about each model out of the box.

Copair derives most per-model behaviour from generic logic (tier classifier, family-prefix format selection, tier-driven harness defaults). For values that genuinely vary per family — context window, output token limit, native-tool-calling reliability — copair ships the values below.

Honest caveat: values are conservative when uncertain. If copair ships e.g. 16k output for a model that actually supports more, override via model_overrides in your config. See config docs.

Run copair --explain-model <id> for the full resolution trace on any model ID — useful for debugging "why did copair pick X for my model?"

64 families covered·source data on GitHub →

Frontier cloud (closed-source)

Hosted vendor APIs with reliable native tool calling. Copair uses the provider SDK directly rather than text-extraction formatters.

Anthropic Claude Opus (modern)

/^claude-opus/
ctx 200kout 32ktool reliable
pattern^claude-opus
context_window200,000 tokens
max_tokens32,000 tokens
native_tool_callingreliable

Anthropic Claude Sonnet / Haiku (modern)

/^claude-(?:sonnet|haiku)/
ctx 200kout 64ktool reliable
pattern^claude-(?:sonnet|haiku)
context_window200,000 tokens
max_tokens64,000 tokens
native_tool_callingreliable

Anthropic Claude 3.x

/^claude-3/
ctx 200kout 8.2ktool reliable
pattern^claude-3
context_window200,000 tokens
max_tokens8,192 tokens
native_tool_callingreliable

OpenAI GPT-5 Mini

/^gpt-5-mini/
ctx 400kout 4.1ktool reliable
pattern^gpt-5-mini
context_window400,000 tokens
max_tokens4,096 tokens
native_tool_callingreliable

OpenAI GPT-5

/^gpt-5/
ctx 400kout 16.4ktool reliable
pattern^gpt-5
context_window400,000 tokens
max_tokens16,384 tokens
native_tool_callingreliable

OpenAI GPT-4o

/^gpt-4o/
ctx 128kout 16.4ktool reliable
pattern^gpt-4o
context_window128,000 tokens
max_tokens16,384 tokens
native_tool_callingreliable

OpenAI GPT-4

/^gpt-4/
ctx 128kout 8.2ktool reliable
pattern^gpt-4
context_window128,000 tokens
max_tokens8,192 tokens
native_tool_callingreliable

OpenAI o-series

/^o[134](?:-mini|-pro)?\b/
ctx 200kout 32ktool reliable
pattern^o[134](?:-mini|-pro)?\b
context_window200,000 tokens
max_tokens32,000 tokens
native_tool_callingreliable

Google Gemini 2.5+

/^gemini-(?:2-5|3)/
ctx 1Mout 65.5ktool reliable
pattern^gemini-(?:2-5|3)
context_window1,000,000 tokens
max_tokens65,536 tokens
native_tool_callingreliable

Google Gemini 2.x

/^gemini-2/
ctx 1Mout 8.2ktool reliable
pattern^gemini-2
context_window1,000,000 tokens
max_tokens8,192 tokens
native_tool_callingreliable

xAI Grok 4

/^grok-4/
ctx 256kout 16.4ktool reliable
pattern^grok-4
context_window256,000 tokens
max_tokens16,384 tokens
native_tool_callingreliable

xAI Grok 1-3

/^grok-[1-3]/
ctx 131.1kout 8.2ktool reliable
pattern^grok-[1-3]
context_window131,072 tokens
max_tokens8,192 tokens
native_tool_callingreliable

Moonshot Kimi K2

/^kimi-k2/
ctx 200kout 16.4k
pattern^kimi-k2
context_window200,000 tokens
max_tokens16,384 tokens

MiniMax M1

/^minimax-m1/
ctx 1Mout 16.4k
pattern^minimax-m1
context_window1,000,000 tokens
max_tokens16,384 tokens

MiniMax M2+

/^minimax-m[2-9]/
ctx 200kout 16.4k
pattern^minimax-m[2-9]
context_window200,000 tokens
max_tokens16,384 tokens

gpt-oss (open-weight)

/^gpt-?oss-?(?:20|120)b/
ctx 131.1kout 8.2k
pattern^gpt-?oss-?(?:20|120)b
context_window131,072 tokens
max_tokens8,192 tokens

Frontier open-weight (large)

Large open-weight models (typically 30B+ parameters). Tool calling tends to be unreliable; copair uses text-extraction with qwen-xml or dsml format depending on family.

Qwen3-Coder 480B

/^qwen3-coder-480b/
ctx 262.1kout 32.8k
pattern^qwen3-coder-480b
context_window262,144 tokens
max_tokens32,768 tokens

Qwen3 235B

/^qwen3-(?:vl-)?235b/
ctx 262.1kout 16.4k
pattern^qwen3-(?:vl-)?235b
context_window262,144 tokens
max_tokens16,384 tokens

Qwen3-Next 80B

/^qwen3-next-80b/
ctx 262.1kout 16.4k
pattern^qwen3-next-80b
context_window262,144 tokens
max_tokens16,384 tokens

Alibaba Qwen-Plus / Qwen-Turbo (1M context, Qwen3+ defaults)

/^qwen-(?:plus|turbo)/
ctx 1Mout 8.2k
pattern^qwen-(?:plus|turbo)
context_window1,000,000 tokens
max_tokens8,192 tokens

Alibaba Qwen3-Max

/^qwen3-max/
ctx 262.1kout 32.8k
pattern^qwen3-max
context_window262,144 tokens
max_tokens32,768 tokens

Mistral / Pixtral Large

/^(?:mistral|pixtral)-large/
ctx 131.1kout 8.2k
pattern^(?:mistral|pixtral)-large
context_window131,072 tokens
max_tokens8,192 tokens

Small open-weight (≤ ~14B)

Small open-weight models. The small-model harness auto-engages: max-turn cap, ask_user / task_complete tools, per-turn format reminders.

Qwen3-Coder 30B

/^qwen3-coder-30b/
ctx 262.1kout 16.4k
pattern^qwen3-coder-30b
context_window262,144 tokens
max_tokens16,384 tokens

Qwen3 32B/30B

/^qwen3-(?:vl-)?(?:30b-a3b|32b)/
ctx 131.1kout 8.2k
pattern^qwen3-(?:vl-)?(?:30b-a3b|32b)
context_window131,072 tokens
max_tokens8,192 tokens

Qwen2/2.5 32B/72B

/^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)/
ctx 131.1kout 8.2k
pattern^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)
context_window131,072 tokens
max_tokens8,192 tokens

Qwen3 small (0.6B–14B)

/^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b/
ctx 131.1kout 8.2k
pattern^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b
context_window131,072 tokens
max_tokens8,192 tokens

Qwen2/2.5 7B/14B

/^qwen2(?:-5)?-(?:coder-)?(?:7|14)b/
ctx 131.1kout 8.2k
pattern^qwen2(?:-5)?-(?:coder-)?(?:7|14)b
context_window131,072 tokens
max_tokens8,192 tokens

DeepSeek frontier (V3.x / R1)

/^deepseek-(?:v[34]|r[12])(?!.*-distill)/
ctx 131.1kout 8.2k
pattern^deepseek-(?:v[34]|r[12])(?!.*-distill)
context_window131,072 tokens
max_tokens8,192 tokens

DeepSeek API alias

/^deepseek-(?:chat|reasoner)/
ctx 131.1kout 8.2k
pattern^deepseek-(?:chat|reasoner)
context_window131,072 tokens
max_tokens8,192 tokens

DeepSeek R1 distill ≤8B

/^deepseek-r1.*?-(?:1-5|7|8)b/
ctx 131.1kout 8.2k
pattern^deepseek-r1.*?-(?:1-5|7|8)b
context_window131,072 tokens
max_tokens8,192 tokens

DeepSeek R1 distill ≥14B

/^deepseek-r1.*?-(?:14|32|70)b/
ctx 131.1kout 8.2k
pattern^deepseek-r1.*?-(?:14|32|70)b
context_window131,072 tokens
max_tokens8,192 tokens

DeepSeek Coder 1.3B

/^deepseek-coder-1-3b/
ctx 16.4k
pattern^deepseek-coder-1-3b
context_window16,384 tokens

Llama 4 Scout (10M context — longest open-weight as of 2025)

/^llama-?4-scout/
ctx 10Mout 8.2k
pattern^llama-?4-scout
context_window10,000,000 tokens
max_tokens8,192 tokens

Llama 4 Maverick

/^llama-?4-maverick/
ctx 1Mout 8.2k
pattern^llama-?4-maverick
context_window1,000,000 tokens
max_tokens8,192 tokens

Llama 3.x large (70B+)

/^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)/
ctx 131.1kout 8.2k
pattern^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)
context_window131,072 tokens
max_tokens8,192 tokens

Llama 3.x small

/^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)/
ctx 131.1kout 8.2k
pattern^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)
context_window131,072 tokens
max_tokens8,192 tokens

Mistral Codestral

/^codestral/
ctx 256kout 16.4k
pattern^codestral
context_window256,000 tokens
max_tokens16,384 tokens

Mistral Magistral (reasoning model)

/^magistral-(?:medium|small)/
ctx 131.1kout 16.4k
pattern^magistral-(?:medium|small)
context_window131,072 tokens
max_tokens16,384 tokens

Mistral Pixtral 12B (vision)

/^pixtral-12b/
ctx 128kout 8.2k
pattern^pixtral-12b
context_window128,000 tokens
max_tokens8,192 tokens

Mistral Medium / Small 3+

/^mistral-(?:medium|small-[34])/
ctx 131.1kout 8.2k
pattern^mistral-(?:medium|small-[34])
context_window131,072 tokens
max_tokens8,192 tokens

Mistral Nemo 12B

/^mistral-nemo/
ctx 131.1kout 8.2k
pattern^mistral-nemo
context_window131,072 tokens
max_tokens8,192 tokens

Mixtral 8x22B

/^mixtral-8x22b/
ctx 65.5kout 8.2k
pattern^mixtral-8x22b
context_window65,536 tokens
max_tokens8,192 tokens

Mixtral 8x7B

/^mixtral-8x7b/
ctx 32.8k
pattern^mixtral-8x7b
context_window32,768 tokens

Ministral

/^ministral-(?:3|7|14)b/
ctx 131.1kout 8.2k
pattern^ministral-(?:3|7|14)b
context_window131,072 tokens
max_tokens8,192 tokens

GLM 4.5+

/^glm-(?:[5-9]|4-[5-9])/
ctx 131.1kout 8.2k
pattern^glm-(?:[5-9]|4-[5-9])
context_window131,072 tokens
max_tokens8,192 tokens

GLM-4 9B

/^glm-4-9b/
ctx 131.1kout 8.2k
pattern^glm-4-9b
context_window131,072 tokens
max_tokens8,192 tokens

Phi-4 small

/^phi-?4-(?:mini|multimodal)/
ctx 131.1k
pattern^phi-?4-(?:mini|multimodal)
context_window131,072 tokens

Phi-4 14B

/^phi-?4(?:-14b)?\b/
ctx 131.1kout 8.2k
pattern^phi-?4(?:-14b)?\b
context_window131,072 tokens
max_tokens8,192 tokens

Phi-3 small

/^phi-?3(?:-5)?-(?:mini|small)/
ctx 131.1k
pattern^phi-?3(?:-5)?-(?:mini|small)
context_window131,072 tokens

Gemma 3

/^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)/
ctx 131.1kout 8.2k
pattern^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)
context_window131,072 tokens
max_tokens8,192 tokens

Gemma 2 large (8k context — predates extended-context release)

/^gemma-?2-?(?:9b|27b)/
ctx 8.2kout 8.2k
pattern^gemma-?2-?(?:9b|27b)
context_window8,192 tokens
max_tokens8,192 tokens

Gemma 2 small (8k context)

/^gemma-?2-?(?:2b|4b)/
ctx 8.2k
pattern^gemma-?2-?(?:2b|4b)
context_window8,192 tokens

Cohere Command R+ / R7B / A

/^command-(?:a|r-plus|r7b)/
ctx 128k
pattern^command-(?:a|r-plus|r7b)
context_window128,000 tokens

Cohere Command R

/^command-r/
ctx 128k
pattern^command-r
context_window128,000 tokens

IBM Granite 3+/4 small-to-mid

/^granite-?[34](?:-\d+)*-(?:2|3|8|30)b/
ctx 131.1kout 8.2k
pattern^granite-?[34](?:-\d+)*-(?:2|3|8|30)b
context_window131,072 tokens
max_tokens8,192 tokens

NVIDIA Nemotron (any size)

/^nemotron/
ctx 131.1kout 8.2k
pattern^nemotron
context_window131,072 tokens
max_tokens8,192 tokens

Llama-Nemotron variants

/^(?:llama-?[34](?:-\d+)*-)?nemotron/
ctx 131.1kout 8.2k
pattern^(?:llama-?[34](?:-\d+)*-)?nemotron
context_window131,072 tokens
max_tokens8,192 tokens

AI21 Jamba Large / Mini

/^jamba-?(?:large|2-?large|mini|2-?mini)/
ctx 256kout 8.2k
pattern^jamba-?(?:large|2-?large|mini|2-?mini)
context_window256,000 tokens
max_tokens8,192 tokens

Amazon Nova Pro/Premier/Lite

/^nova-(?:pro|premier|lite)/
ctx 300kout 8.2k
pattern^nova-(?:pro|premier|lite)
context_window300,000 tokens
max_tokens8,192 tokens

Amazon Nova Micro

/^nova-micro/
ctx 128k
pattern^nova-micro
context_window128,000 tokens

Reka

/^reka-(?:core|flash|edge)/
ctx 131.1kout 8.2k
pattern^reka-(?:core|flash|edge)
context_window131,072 tokens
max_tokens8,192 tokens

Yi-Coder small

/^yi-coder-(?:1-5|9)b/
ctx 131.1kout 8.2k
pattern^yi-coder-(?:1-5|9)b
context_window131,072 tokens
max_tokens8,192 tokens

Cohere Aya Expanse 8B / 32B

/^aya-(?:expanse-)?(?:8b|32b)/
ctx 131.1kout 8.2k
pattern^aya-(?:expanse-)?(?:8b|32b)
context_window131,072 tokens
max_tokens8,192 tokens

BigCode StarCoder2 (3B/7B/15B) — note: 16k context, smaller than safe default

/^starcoder-?2/
ctx 16.4k
pattern^starcoder-?2
context_window16,384 tokens

Models not listed here

If copair recognizes the model's family (it matches a classifier rule) but the family isn't a row above, the unset fields fall back to safe defaults — 32k context, 4k output, fenced-block format — while the tier comes from the family rule. Override any field per model via your config — see the configuration reference.

A model copair doesn't recognize at all stops with a clear error asking you to declare at least a tier — see custom & local models. This isn't a comprehensive registry — copair deliberately doesn't try to know about every model; generic logic + this sparse data + your overrides cover the common cases.

Last updated June 2, 2026