Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Mistral: Mistral Small 4 vs Laguna M.1

Last updated at: 2026-04-29

Metric Mistral Small 4 Mistral Small 4 medium Release: 2026-03-16 Laguna M.1 Laguna M.1 none Release: 2026-04-28 Free Available
Score 5.7 5.1
Rank #96 #117
Reliability N/A 9.9
Consistency 6.8 8.7
Tests Correct
Attempt pass rate 50.0% 33.3%
Flaky tests 7 3
Total Runs 54 54
Cost per result 0.674 0.000
Total Cost $0.034 $0.000
Input Price $0.150 / 1M $0.000 / 1M
Output Price $0.600 / 1M $0.000 / 1M
Output Tokens 15,084 2,870
Reasoning Tokens 39,408 0
Response Time (avg) 5.64s 2.79s
Response Time (max) 30.49s 15.42s
Response Time (total) 101.52s 50.24s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 5.6 3.8 66.7% 3 2.67s 4,055 4,778
Laguna M.1 3.4 7.9 16.7% 1 1.23s 485 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 6.7 3.5 66.7% 1 30.49s 2,796 11,296
Laguna M.1 7.5 3.8 66.7% 1 2.93s 543 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 3.0 10.0 0.0% 0 25.25s 2,612 10,700
Laguna M.1 3.0 10.0 0.0% 0 4.32s 622 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 7.3 5.9 83.3% 1 1.23s 335 723
Laguna M.1 10.0 10.0 100.0% 0 3.37s 246 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 5.3 7.2 44.4% 1 6.11s 2,621 6,904
Laguna M.1 3.6 7.2 22.2% 1 5.50s 33 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 4.8 10.0 0.0% 0 2.05s 821 828
Laguna M.1 4.0 10.0 0.0% 0 3.08s 212 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 7.3 5.8 83.3% 1 1.38s 540 1,031
Laguna M.1 6.3 10.0 50.0% 0 683ms 80 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 3.4 9.7 0.0% 0 2.00s 983 2,338
Laguna M.1 3.2 10.0 0.0% 0 951ms 340 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Mistral Small 4 10.0 10.0 100.0% 0 3.50s 321 810
Laguna M.1 10.0 10.0 100.0% 0 7.54s 309 0

Quick Compare

Switch Comparison Pair