Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

MiniMax: MiniMax M2.7 vs Mistral: Mistral Small 4

Last updated at: 2026-05-01

Metric MiniMax M2.7 MiniMax M2.7 medium Release: 2026-03-18 Mistral Small 4 Mistral Small 4 none Release: 2026-03-16
Score 5.3 5.2
Rank #110 #115
Reliability N/A N/A
Consistency 5.5 9.5
Tests Correct
Attempt pass rate 51.9% 31.5%
Flaky tests 10 1
Total Runs 54 54
Cost per result 2.273 0.118
Total Cost $0.091 $0.006
Input Price $0.300 / 1M $0.150 / 1M
Output Price $1.200 / 1M $0.600 / 1M
Output Tokens 4,984 2,207
Reasoning Tokens 62,787 0
Response Time (avg) 31.08s 665ms
Response Time (max) 117.04s 1.72s
Response Time (total) 528.37s 11.97s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 7.9 6.3 83.3% 2 40.32s 3,010 17,716
Mistral Small 4 3.4 7.9 16.7% 1 395ms 182 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 10.0 10.0 100.0% 0 91.27s 467 15,175
Mistral Small 4 4.5 9.0 0.0% 0 1.28s 583 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 4.7 1.6 66.7% 1 41.03s 369 4,480
Mistral Small 4 3.0 10.0 0.0% 0 1.72s 496 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 6.3 5.8 66.7% 1 21.95s 187 5,882
Mistral Small 4 10.0 10.0 100.0% 0 822ms 261 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 3.0 10.0 0.0% 0 19.00s 8 2,796
Mistral Small 4 5.3 10.0 33.3% 0 367ms 28 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 3.9 2.5 33.3% 1 38.70s 92 5,204
Mistral Small 4 4.0 10.0 0.0% 0 729ms 205 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 3.8 1.6 50.0% 2 12.64s 213 2,457
Mistral Small 4 6.5 10.0 50.0% 0 380ms 69 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 3.8 4.5 33.3% 2 25.62s 334 8,076
Mistral Small 4 3.1 9.9 0.0% 0 589ms 170 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.7 4.7 1.6 66.7% 1 12.05s 304 1,001
Mistral Small 4 10.0 10.0 100.0% 0 1.40s 213 0

Quick Compare

Switch Comparison Pair