Model Capabilities

What copair knows about each model out of the box.

Copair derives most per-model behaviour from generic logic (tier classifier, family-prefix format selection, tier-driven harness defaults). For values that genuinely vary per family — context window, output token limit, native-tool-calling reliability — copair ships the values below.

Honest caveat: values are conservative when uncertain. If copair ships e.g. 16k output for a model that actually supports more, override via model_overrides in your config. See config docs.

Run copair --explain-model <id> for the full resolution trace on any model ID — useful for debugging "why did copair pick X for my model?"

64 families covered·source data on GitHub →

Frontier cloud (closed-source)

Hosted vendor APIs with reliable native tool calling. Copair uses the provider SDK directly rather than text-extraction formatters.

Anthropic Claude Opus (modern)

/^claude-opus/

ctx 200kout 32ktool reliable

pattern^claude-opus

context_window200,000 tokens

max_tokens32,000 tokens

native_tool_callingreliable

Anthropic Claude Sonnet / Haiku (modern)

/^claude-(?:sonnet|haiku)/

ctx 200kout 64ktool reliable

pattern^claude-(?:sonnet|haiku)

context_window200,000 tokens

max_tokens64,000 tokens

native_tool_callingreliable

Anthropic Claude 3.x

/^claude-3/

ctx 200kout 8.2ktool reliable

pattern^claude-3

context_window200,000 tokens

max_tokens8,192 tokens

native_tool_callingreliable

OpenAI GPT-5 Mini

/^gpt-5-mini/

ctx 400kout 4.1ktool reliable

pattern^gpt-5-mini

context_window400,000 tokens

max_tokens4,096 tokens

native_tool_callingreliable

OpenAI GPT-5

/^gpt-5/

ctx 400kout 16.4ktool reliable

pattern^gpt-5

context_window400,000 tokens

max_tokens16,384 tokens

native_tool_callingreliable

OpenAI GPT-4o

/^gpt-4o/

ctx 128kout 16.4ktool reliable

pattern^gpt-4o

context_window128,000 tokens

max_tokens16,384 tokens

native_tool_callingreliable

OpenAI GPT-4

/^gpt-4/

ctx 128kout 8.2ktool reliable

pattern^gpt-4

context_window128,000 tokens

max_tokens8,192 tokens

native_tool_callingreliable

OpenAI o-series

/^o[134](?:-mini|-pro)?\b/

ctx 200kout 32ktool reliable

pattern^o[134](?:-mini|-pro)?\b

context_window200,000 tokens

max_tokens32,000 tokens

native_tool_callingreliable

Google Gemini 2.5+

/^gemini-(?:2-5|3)/

ctx 1Mout 65.5ktool reliable

pattern^gemini-(?:2-5|3)

context_window1,000,000 tokens

max_tokens65,536 tokens

native_tool_callingreliable

Google Gemini 2.x

/^gemini-2/

ctx 1Mout 8.2ktool reliable

pattern^gemini-2

context_window1,000,000 tokens

max_tokens8,192 tokens

native_tool_callingreliable

xAI Grok 4

/^grok-4/

ctx 256kout 16.4ktool reliable

pattern^grok-4

context_window256,000 tokens

max_tokens16,384 tokens

native_tool_callingreliable

xAI Grok 1-3

/^grok-[1-3]/

ctx 131.1kout 8.2ktool reliable

pattern^grok-[1-3]

context_window131,072 tokens

max_tokens8,192 tokens

native_tool_callingreliable

Moonshot Kimi K2

/^kimi-k2/

ctx 200kout 16.4k

pattern^kimi-k2

context_window200,000 tokens

max_tokens16,384 tokens

MiniMax M1

/^minimax-m1/

ctx 1Mout 16.4k

pattern^minimax-m1

context_window1,000,000 tokens

max_tokens16,384 tokens

MiniMax M2+

/^minimax-m[2-9]/

ctx 200kout 16.4k

pattern^minimax-m[2-9]

context_window200,000 tokens

max_tokens16,384 tokens

gpt-oss (open-weight)

/^gpt-?oss-?(?:20|120)b/

ctx 131.1kout 8.2k

pattern^gpt-?oss-?(?:20|120)b

context_window131,072 tokens

max_tokens8,192 tokens

Frontier open-weight (large)

Large open-weight models (typically 30B+ parameters). Tool calling tends to be unreliable; copair uses text-extraction with qwen-xml or dsml format depending on family.

Qwen3-Coder 480B

/^qwen3-coder-480b/

ctx 262.1kout 32.8k

pattern^qwen3-coder-480b

context_window262,144 tokens

max_tokens32,768 tokens

Qwen3 235B

/^qwen3-(?:vl-)?235b/

ctx 262.1kout 16.4k

pattern^qwen3-(?:vl-)?235b

context_window262,144 tokens

max_tokens16,384 tokens

Qwen3-Next 80B

/^qwen3-next-80b/

ctx 262.1kout 16.4k

pattern^qwen3-next-80b

context_window262,144 tokens

max_tokens16,384 tokens

Alibaba Qwen-Plus / Qwen-Turbo (1M context, Qwen3+ defaults)

/^qwen-(?:plus|turbo)/

ctx 1Mout 8.2k

pattern^qwen-(?:plus|turbo)

context_window1,000,000 tokens

max_tokens8,192 tokens

Alibaba Qwen3-Max

/^qwen3-max/

ctx 262.1kout 32.8k

pattern^qwen3-max

context_window262,144 tokens

max_tokens32,768 tokens

Mistral / Pixtral Large

/^(?:mistral|pixtral)-large/

ctx 131.1kout 8.2k

pattern^(?:mistral|pixtral)-large

context_window131,072 tokens

max_tokens8,192 tokens

Small open-weight (≤ ~14B)

Small open-weight models. The small-model harness auto-engages: max-turn cap, ask_user / task_complete tools, per-turn format reminders.

Qwen3-Coder 30B

/^qwen3-coder-30b/

ctx 262.1kout 16.4k

pattern^qwen3-coder-30b

context_window262,144 tokens

max_tokens16,384 tokens

Qwen3 32B/30B

/^qwen3-(?:vl-)?(?:30b-a3b|32b)/

ctx 131.1kout 8.2k

pattern^qwen3-(?:vl-)?(?:30b-a3b|32b)

context_window131,072 tokens

max_tokens8,192 tokens

Qwen2/2.5 32B/72B

/^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)/

ctx 131.1kout 8.2k

pattern^qwen2(?:-5)?-(?:coder-)?(?:32b|72b)

context_window131,072 tokens

max_tokens8,192 tokens

Qwen3 small (0.6B–14B)

/^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b/

ctx 131.1kout 8.2k

pattern^qwen3-(?:vl-)?(?:0-6|1-7|4|8|14)b

context_window131,072 tokens

max_tokens8,192 tokens

Qwen2/2.5 7B/14B

/^qwen2(?:-5)?-(?:coder-)?(?:7|14)b/

ctx 131.1kout 8.2k

pattern^qwen2(?:-5)?-(?:coder-)?(?:7|14)b

context_window131,072 tokens

max_tokens8,192 tokens

DeepSeek frontier (V3.x / R1)

/^deepseek-(?:v[34]|r[12])(?!.*-distill)/

ctx 131.1kout 8.2k

pattern^deepseek-(?:v[34]|r[12])(?!.*-distill)

context_window131,072 tokens

max_tokens8,192 tokens

DeepSeek API alias

/^deepseek-(?:chat|reasoner)/

ctx 131.1kout 8.2k

pattern^deepseek-(?:chat|reasoner)

context_window131,072 tokens

max_tokens8,192 tokens

DeepSeek R1 distill ≤8B

/^deepseek-r1.*?-(?:1-5|7|8)b/

ctx 131.1kout 8.2k

pattern^deepseek-r1.*?-(?:1-5|7|8)b

context_window131,072 tokens

max_tokens8,192 tokens

DeepSeek R1 distill ≥14B

/^deepseek-r1.*?-(?:14|32|70)b/

ctx 131.1kout 8.2k

pattern^deepseek-r1.*?-(?:14|32|70)b

context_window131,072 tokens

max_tokens8,192 tokens

DeepSeek Coder 1.3B

/^deepseek-coder-1-3b/

ctx 16.4k

pattern^deepseek-coder-1-3b

context_window16,384 tokens

Llama 4 Scout (10M context — longest open-weight as of 2025)

/^llama-?4-scout/

ctx 10Mout 8.2k

pattern^llama-?4-scout

context_window10,000,000 tokens

max_tokens8,192 tokens

Llama 4 Maverick

/^llama-?4-maverick/

ctx 1Mout 8.2k

pattern^llama-?4-maverick

context_window1,000,000 tokens

max_tokens8,192 tokens

Llama 3.x large (70B+)

/^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)/

ctx 131.1kout 8.2k

pattern^llama-?[34](?:-\d+)*-(?:70b|72b|90b|405b)

context_window131,072 tokens

max_tokens8,192 tokens

Llama 3.x small

/^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)/

ctx 131.1kout 8.2k

pattern^llama-?[34](?:-\d+)*-(?:1b|3b|7b|8b|11b)

context_window131,072 tokens

max_tokens8,192 tokens

Mistral Codestral

/^codestral/

ctx 256kout 16.4k

pattern^codestral

context_window256,000 tokens

max_tokens16,384 tokens

Mistral Magistral (reasoning model)

/^magistral-(?:medium|small)/

ctx 131.1kout 16.4k

pattern^magistral-(?:medium|small)

context_window131,072 tokens

max_tokens16,384 tokens

Mistral Pixtral 12B (vision)

/^pixtral-12b/

ctx 128kout 8.2k

pattern^pixtral-12b

context_window128,000 tokens

max_tokens8,192 tokens

Mistral Medium / Small 3+

/^mistral-(?:medium|small-[34])/

ctx 131.1kout 8.2k

pattern^mistral-(?:medium|small-[34])

context_window131,072 tokens

max_tokens8,192 tokens

Mistral Nemo 12B

/^mistral-nemo/

ctx 131.1kout 8.2k

pattern^mistral-nemo

context_window131,072 tokens

max_tokens8,192 tokens

Mixtral 8x22B

/^mixtral-8x22b/

ctx 65.5kout 8.2k

pattern^mixtral-8x22b

context_window65,536 tokens

max_tokens8,192 tokens

Mixtral 8x7B

/^mixtral-8x7b/

ctx 32.8k

pattern^mixtral-8x7b

context_window32,768 tokens

Ministral

/^ministral-(?:3|7|14)b/

ctx 131.1kout 8.2k

pattern^ministral-(?:3|7|14)b

context_window131,072 tokens

max_tokens8,192 tokens

GLM 4.5+

/^glm-(?:[5-9]|4-[5-9])/

ctx 131.1kout 8.2k

pattern^glm-(?:[5-9]|4-[5-9])

context_window131,072 tokens

max_tokens8,192 tokens

GLM-4 9B

/^glm-4-9b/

ctx 131.1kout 8.2k

pattern^glm-4-9b

context_window131,072 tokens

max_tokens8,192 tokens

Phi-4 small

/^phi-?4-(?:mini|multimodal)/

ctx 131.1k

pattern^phi-?4-(?:mini|multimodal)

context_window131,072 tokens

Phi-4 14B

/^phi-?4(?:-14b)?\b/

ctx 131.1kout 8.2k

pattern^phi-?4(?:-14b)?\b

context_window131,072 tokens

max_tokens8,192 tokens

Phi-3 small

/^phi-?3(?:-5)?-(?:mini|small)/

ctx 131.1k

pattern^phi-?3(?:-5)?-(?:mini|small)

context_window131,072 tokens

Gemma 3

/^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)/

ctx 131.1kout 8.2k

pattern^gemma-?[3]-?(?:1b|2b|4b|9b|12b|27b)

context_window131,072 tokens

max_tokens8,192 tokens

Gemma 2 large (8k context — predates extended-context release)

/^gemma-?2-?(?:9b|27b)/

ctx 8.2kout 8.2k

pattern^gemma-?2-?(?:9b|27b)

context_window8,192 tokens

max_tokens8,192 tokens

Gemma 2 small (8k context)

/^gemma-?2-?(?:2b|4b)/

ctx 8.2k

pattern^gemma-?2-?(?:2b|4b)

context_window8,192 tokens

Cohere Command R+ / R7B / A

/^command-(?:a|r-plus|r7b)/

ctx 128k

pattern^command-(?:a|r-plus|r7b)

context_window128,000 tokens

Cohere Command R

/^command-r/

ctx 128k

pattern^command-r

context_window128,000 tokens

IBM Granite 3+/4 small-to-mid

/^granite-?[34](?:-\d+)*-(?:2|3|8|30)b/

ctx 131.1kout 8.2k

pattern^granite-?[34](?:-\d+)*-(?:2|3|8|30)b

context_window131,072 tokens

max_tokens8,192 tokens

NVIDIA Nemotron (any size)

/^nemotron/

ctx 131.1kout 8.2k

pattern^nemotron

context_window131,072 tokens

max_tokens8,192 tokens

Llama-Nemotron variants

/^(?:llama-?[34](?:-\d+)*-)?nemotron/

ctx 131.1kout 8.2k

pattern^(?:llama-?[34](?:-\d+)*-)?nemotron

context_window131,072 tokens

max_tokens8,192 tokens

AI21 Jamba Large / Mini

/^jamba-?(?:large|2-?large|mini|2-?mini)/

ctx 256kout 8.2k

pattern^jamba-?(?:large|2-?large|mini|2-?mini)

context_window256,000 tokens

max_tokens8,192 tokens

Amazon Nova Pro/Premier/Lite

/^nova-(?:pro|premier|lite)/

ctx 300kout 8.2k

pattern^nova-(?:pro|premier|lite)

context_window300,000 tokens

max_tokens8,192 tokens

Amazon Nova Micro

/^nova-micro/

ctx 128k

pattern^nova-micro

context_window128,000 tokens

Reka

/^reka-(?:core|flash|edge)/

ctx 131.1kout 8.2k

pattern^reka-(?:core|flash|edge)

context_window131,072 tokens

max_tokens8,192 tokens

Yi-Coder small

/^yi-coder-(?:1-5|9)b/

ctx 131.1kout 8.2k

pattern^yi-coder-(?:1-5|9)b

context_window131,072 tokens

max_tokens8,192 tokens

Cohere Aya Expanse 8B / 32B

/^aya-(?:expanse-)?(?:8b|32b)/

ctx 131.1kout 8.2k

pattern^aya-(?:expanse-)?(?:8b|32b)

context_window131,072 tokens

max_tokens8,192 tokens

BigCode StarCoder2 (3B/7B/15B) — note: 16k context, smaller than safe default

/^starcoder-?2/

ctx 16.4k

pattern^starcoder-?2

context_window16,384 tokens

Models not listed here

If copair recognizes the model's family (it matches a classifier rule) but the family isn't a row above, the unset fields fall back to safe defaults — 32k context, 4k output, fenced-block format — while the tier comes from the family rule. Override any field per model via your config — see the configuration reference.

A model copair doesn't recognize at all stops with a clear error asking you to declare at least a tier — see custom & local models. This isn't a comprehensive registry — copair deliberately doesn't try to know about every model; generic logic + this sparse data + your overrides cover the common cases.

Last updated July 1, 2026