Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Anthropic: Claude Sonnet 5 vs North Mini Code

Summary

Claude Sonnet 5 vs North Mini Code benchmark comparison: North Mini Code leads on average score with 5.8 vs 5.7. North Mini Code has the lower benchmark cost at $0.000 vs $0.287. Claude Sonnet 5 is faster at 4.74s vs 106.18s, with pass rates of 42.9% vs 50.8%.

Recommended model: Claude Sonnet 5 - Its score stays close to the best score here (5.7 vs 5.8), while responding about 22.4x faster than North Mini Code.

Last updated at: 2026-06-30

Metric Claude Sonnet 5 Claude Sonnet 5 none Release: 2026-06-30 North Mini Code North Mini Code medium Release: 2026-06-18 Free Available
Score 5.7 5.8
Rank #117 #109
Reliability 10.0 8.5
Consistency 8.6 8.5
Tests Correct
Attempt pass rate 42.9% 50.8%
Flaky tests 4 4
Total Runs 63 55
Cost per result 4.098 0.000
Total Cost $0.287 $0.000
Input Price $2.000 / 1M $0.000 / 1M
Output Price $10.000 / 1M $0.000 / 1M
Total Input Tokens 76,797 32,891
Output Tokens 13,325 424,772
Reasoning Tokens 0 1,021,489
Response Time (avg) 4.74s 106.18s
Response Time (max) 29.46s 357.05s
Response Time (total) 99.46s 2229.70s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#117 Claude Sonnet 5

none
Cost
$0.061
Time
53.7s
Tokens
6,172 tok

#109 North Mini Code

medium
Cost
$0.000
Time
51.8s
Tokens
12,460 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 5.3 10.0 25.0% 0 3.60s 834 1,813 0
North Mini Code 8.4 10.0 75.0% 0 64.79s 324 64,441 68,535
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 4.6 7.9 22.2% 1 3.67s 10,590 1,864 0
North Mini Code 4.5 4.9 33.3% 2 320.43s 7,119 219,891 561,569
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 3.0 10.0 0.0% 0 29.46s 38,775 6,340 0
North Mini Code 2.8 1.6 33.3% 1 323.07s 14,760 0 151,500
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 10.0 10.0 100.0% 0 3.01s 10,503 309 0
North Mini Code 10.0 10.0 100.0% 0 24.06s 6,819 240 2,659
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 5.3 7.2 44.4% 1 3.28s 975 933 0
North Mini Code 5.3 7.2 44.4% 1 71.37s 621 8,483 104,079
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 4.7 3.1 33.3% 1 2.81s 708 272 0
North Mini Code 5.1 10.0 0.0% 0 25.08s 444 1,546 1,635
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 6.4 10.0 50.0% 0 2.58s 909 103 0
North Mini Code 9.8 10.0 100.0% 0 15.43s 379 909 1,339
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 6.0 7.4 55.6% 1 3.22s 894 778 0
North Mini Code 3.3 10.0 0.0% 0 19.70s 543 2,215 2,485
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 10.0 10.0 100.0% 0 6.80s 12,351 522 0
North Mini Code 10.0 10.0 100.0% 0 3.93s 1,776 41 563
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Claude Sonnet 5 3.0 10.0 0.0% 0 4.31s 258 391 0
North Mini Code 3.0 10.0 0.0% 0 305.02s 106 127,006 127,125

Quick Compare

Switch Comparison Pair