Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

North Mini Code vs Mistral: Mistral Small 4

Summary

North Mini Code vs Mistral Small 4 benchmark comparison: The average score is effectively tied at 5.1 vs 5.1. North Mini Code has the lower benchmark cost at $0.000 vs $0.068. Mistral Small 4 is faster at 9.40s vs 29.82s, with pass rates of 19.1% vs 44.4%.

Recommended model: Mistral Small 4 - It has the best score here (5.1), while responding about 3.2x faster than North Mini Code.

Last updated at: 2026-06-18

Metric North Mini Code North Mini Code none Release: 2026-06-18 Free Available Mistral Small 4 Mistral Small 4 medium Release: 2026-03-16
Score 5.1 5.1
Rank #131 #133
Reliability 8.5 10.0
Consistency 9.9 6.9
Tests Correct
Attempt pass rate 19.1% 44.4%
Flaky tests 0 8
Total Runs 57 63
Cost per result 0.000 1.344
Total Cost $0.000 $0.068
Input Price $0.000 / 1M $0.150 / 1M
Output Price $0.000 / 1M $0.600 / 1M
Total Input Tokens 43,264 42,576
Output Tokens 8,278 24,184
Reasoning Tokens 0 84,678
Response Time (avg) 29.82s 9.40s
Response Time (max) 159.85s 59.15s
Response Time (total) 626.26s 197.39s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#131 North Mini Code

none
Cost
$0.000
Time
266.1s
Tokens
63,551 tok

#133 Mistral Small 4

medium
Cost
$0.006
Time
47.9s
Tokens
9,857 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.0 10.0 0.0% 0 22.48s 402 4,075 0
Mistral Small 4 5.6 3.8 66.7% 3 2.67s 708 4,055 4,778
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.9 10.0 0.0% 0 21.96s 7,119 504 0
Mistral Small 4 4.4 5.1 33.3% 2 39.98s 7,636 11,635 54,715
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.5 8.7 0.0% 0 159.85s 24,265 2,920 0
Mistral Small 4 3.0 10.0 0.0% 0 25.25s 18,706 2,612 10,700
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 10.0 10.0 100.0% 0 28.00s 6,819 183 0
Mistral Small 4 7.3 5.9 83.3% 1 1.23s 6,171 335 723
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.0 10.0 0.0% 0 14.73s 621 14 0
Mistral Small 4 5.3 7.2 44.4% 1 6.11s 742 2,621 6,904
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.9 9.6 0.0% 0 34.77s 444 115 0
Mistral Small 4 4.8 10.0 0.0% 0 2.05s 519 821 828
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 6.5 10.0 50.0% 0 30.68s 597 57 0
Mistral Small 4 7.3 5.8 83.3% 1 1.38s 729 540 1,031
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.5 10.0 0.0% 0 24.43s 435 353 0
Mistral Small 4 3.4 9.7 0.0% 0 2.17s 735 1,226 2,632
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 9.5 10.0 100.0% 0 3.64s 2,403 51 0
Mistral Small 4 10.0 10.0 100.0% 0 3.50s 6,420 321 810
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
North Mini Code 3.0 10.0 0.0% 0 37.37s 159 6 0
Mistral Small 4 3.0 10.0 0.0% 0 5.92s 210 18 1,557

Quick Compare

Switch Comparison Pair