#39

Claude Sonnet 4.6

Anthropic · Release: 2026-02-17 · anthropic/claude-sonnet-4.6::none

7.3

Consistency

9.6

$0.252

Total Output Tokens

6,910

Input Price

$3.000 / 1M

Output Price

$15.000 / 1M

Wrong Tests: 7

Attempt pass rate: 62.8%

Flaky tests

1

Flaky tests had mixed outcomes across runs (at least one pass and one fail).

Response Time (avg)

5.12s

Response Time (max): 23.84s

Response Time (total): 51.16s

Extra formatting: 3 Wrong answer: 3 Did not follow instructions: 1

Charts

Choose the first model, then click a second model to open a side-by-side page.

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Quick Compare

Claude Sonnet 4.6nonevsGPT-5.2medium Claude Sonnet 4.6nonevsSeed-2.0-Minimedium Claude Sonnet 4.6nonevsKimi K2.5medium Claude Sonnet 4.6nonevsGPT-5.4 Nanomedium Claude Sonnet 4.6nonevsGrok 4.20medium Claude Sonnet 4.6nonevsGemini 3 Flash Previewmedium Claude Sonnet 4.6nonevsGemini 3.1 Pro Previewmedium Claude Sonnet 4.6nonevsQwen3.6 Plus Previewmedium

Category Breakdown

Category	Score	Consistency	Tests Correct
Anti-AI Tricks	4.8	10.0
Combined	9.5	10.0
Data parsing and extraction	10.0	10.0
Domain specific	7.7	10.0
General Intelligence	6.1	3.1
Instructions following	6.5	10.0
Puzzle Solving	7.7	10.0
Tool Calling	10.0	10.0

Compared models