#40
Anthropic
Release: 2026-02-17
Tested on: 2026-05-08 13:33
anthropic/claude-sonnet-4.6::medium
(medium)
(none)
Input Price
$3.000 / 1M
Output Price
$15.000 / 1M
Flaky tests
1
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Charts
Choose the first model, then click a second model to open a side-by-side page.
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Quick Compare
Claude Sonnet 4.6mediumvsQwen3.6 PlusmediumClaude Sonnet 4.6mediumvsGemini 3.1 Flash Lite PreviewlowClaude Sonnet 4.6mediumvsGemini 3.1 Flash LitemediumClaude Sonnet 4.6mediumvsQwen3.5-122B-A10BmediumClaude Sonnet 4.6mediumvsGPT-5.4mediumClaude Sonnet 4.6mediumvsMiMo-V2.5mediumClaude Sonnet 4.6mediumvsGemini 3 Flash PreviewmediumClaude Sonnet 4.6mediumvsGemini 3.5 FlashlowClaude Sonnet 4.6mediumvsRing-2.6-1Tmedium
Category Breakdown
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 6.5 | 10.0 | |
| Coding | 10.0 | 10.0 | |
| Combined | 10.0 | 10.0 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 2.9 | 7.2 | |
| General Intelligence | 10.0 | 10.0 | |
| Instructions following | 10.0 | 10.0 | |
| Puzzle Solving | 10.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 | |
| Trivia | 3.0 | 10.0 |