Skip to content

Cheatsheet: Choosing your model and the effort dial

FamilyModel IDInput / Output (per MTok)ContextMax output
Opus 4.8 (current flagship)claude-opus-4-8$5 / $251M tok128k tok
Sonnet 4.6claude-sonnet-4-6$3 / $151M tok64k tok
Haiku 4.5claude-haiku-4-5 (alias for claude-haiku-4-5-20251001)$1 / $5200k tok64k tok
Opus 4.7 (legacy, same posture as 4.8)claude-opus-4-7$5 / $251M tok128k tok

Ratios: Sonnet about 1.7x cheaper per output than Opus. Haiku 5x cheaper than Opus.

Opus 4.8 description (verbatim): Anthropic’s most capable model for complex reasoning and agentic coding. Per the docs at drafting; check the Models overview for current.

Sonnet by default (most production workloads).
Opus when the task is genuinely hard.
Haiku when volume matters and the task is light.
Mix-and-match across models inside one app.
GenerationFormWhat it is
4.6 and later (Opus 4.8, Opus 4.7, Sonnet 4.6, Opus 4.6)DatelessIS the pinned snapshot
Pre-4.6 (Haiku 4.5, Sonnet 4.5, Opus 4.5, Opus 4.1)Date-suffixed (canonical)Pinned snapshot
Pre-4.6Dateless (alias)Convenience pointer; can re-resolve

Production: use the pinned form. For 4.6+ that is the dateless ID; for pre-4.6 that is the date-suffixed ID.

client.messages.create(
model="claude-opus-4-8",
max_tokens=4096,
messages=[...],
output_config={"effort": "medium"}, # <-- goes here, not top-level
)
ValueUse case
maxFrontier problems; absolute highest capability. Opus 4.8 / Mythos / Opus 4.7 / Opus 4.6 / Sonnet 4.6
xhighLong-horizon coding and agentic work. Opus 4.8 and Opus 4.7
high (default)Complex reasoning, difficult coding. All supported models
mediumBalanced; the Sonnet 4.6 production default; the Opus 4.8 / 4.7 cost-sensitive default
lowSpeed and lowest cost; simple tasks, subagents

Affects ALL tokens: text response, tool calls, extended thinking. At lower effort, fewer tool calls + terser preambles + combined operations.

Supported on: Opus 4.8, Mythos Preview, Opus 4.7, Opus 4.6, Sonnet 4.6, Opus 4.5. NOT Haiku 4.5.

high and not setting the parameter are equivalent.

ModelProduction defaultNotes
Sonnet 4.6mediumUse low for chat / latency-sensitive; high for complex reasoning
Opus 4.8xhigh (coding / agentic) or high (general)API default is high on all surfaces; medium/low only when evals show quality holds. Docs: “the guidance for Opus 4.7 also applies to Opus 4.8”
Opus 4.7 (legacy, same posture as 4.8)xhigh (coding / agentic) or high (general)medium for cost-sensitive; max only if evals show headroom
Opus 4.6highmedium for cost-sensitive
Haiku 4.5N/A (no effort support)What you get is what the model decides
ModelAdaptive thinkingManual extended thinking
Opus 4.8Recommended (default)NOT supported (returns 400)
Opus 4.7Recommended (default)NOT supported (returns 400)
Sonnet 4.6RecommendedDeprecated but functional
Opus 4.6RecommendedDeprecated but functional
Haiku 4.5NOT supportedSupported (the only mode)

Adaptive: thinking: {type: "adaptive"} (model decides; effort controls depth). Manual: thinking: {type: "enabled", budget_tokens: N, display: "summarized" | "omitted"}.

Response includes thinking content blocks (with a signature field) before text blocks when thinking fires.

Cost shape (worked example, 100k calls/day)

Section titled “Cost shape (worked example, 100k calls/day)”
Feature: 100k calls/day; classifier (500 in + 20 out tokens),
then answer for 70% (500 in + 300 out).
All-Opus: classifier $300 + answer $700 = $1,000/day
All-Sonnet: $180 + $420 = $600/day (~40% cheaper)
Mix (Haiku classifier + Sonnet answer):
$60 + $420 = $480/day (~52% cheaper than all-Opus)

Model + parameter choice both move the dial materially. Lesson 7 (caching) adds the third lever; lesson 12 turns usage logs into dashboards.

FailureRecognize byFix
Defaulting to Opus everywhereHigh bill, similar quality on most callsEval Sonnet against Opus on real prompts; swap where they tie
Using alias in productionSurprise behavior change after Anthropic patchesPin to date-suffixed form on pre-4.6 models
Forgetting effort affects tool callsLoop costs compound mysteriouslyLower effort in inner loop steps; eval the quality trade
Manual thinking on Opus 4.7400 errorSwitch to adaptive thinking + effort
Mixing models without evalSilent quality regression on Haiku stepEval routing decisions; pin per-step model to capability need

What this lesson does NOT cover (and where to find it)

Section titled “What this lesson does NOT cover (and where to find it)”
TopicLands at
Tool use (define + handle)Lesson 4
Server-side tools (web search, code execution)Lesson 5
Model Context ProtocolLesson 6
Prompt caching (the 3rd cost lever)Lesson 7
Agent loop (model + effort compound across steps)Lesson 8 onward
Admin / Usage / Cost APIs (org-level dashboards)Lesson 12
  • Anthropic public Claude docs, Models overview, Extended thinking, Adaptive thinking, Effort, at https://platform.claude.com/docs/. Structural-mirror citation; see references.