Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemma 4 26B A4B vs MiniMax: MiniMax M3

Last updated at: 2026-06-01

Metric Gemma 4 26B A4B Gemma 4 26B A4B medium Release: 2026-04-03 Free Available MiniMax M3 MiniMax M3 medium Release: 2026-06-01
Score 7.8 7.5
Rank #33 #54
Reliability 10.0 9.6
Consistency 9.2 8.4
Tests Correct
Attempt pass rate 73.3% 75.0%
Flaky tests 2 4
Total Runs 60 60
Cost per result 0.317 1.083
Total Cost $0.038 $0.120
Input Price $0.060 / 1M $0.300 / 1M
Output Price $0.330 / 1M $1.200 / 1M
Output Tokens 28,000 46,884
Reasoning Tokens 82,045 85,935
Response Time (avg) 50.92s 68.44s
Response Time (max) 369.32s 431.03s
Response Time (total) 967.47s 1300.32s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 6.20s 1,142 3,045
MiniMax M3 5.5 3.7 66.7% 3 14.95s 874 3,414
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 2.9 10.0 0.0% 0 258.40s 14,838 26,122
MiniMax M3 8.3 10.0 100.0% 0 185.58s 4,071 26,059
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 9.6 10.0 100.0% 0 73.55s 5,415 13,112
MiniMax M3 10.0 10.0 100.0% 0 65.30s 1,306 6,253
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 16.51s 1,567 2,827
MiniMax M3 10.0 10.0 100.0% 0 14.92s 514 3,164
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 2.9 4.4 22.2% 2 23.62s 2,469 7,105
MiniMax M3 6.5 10.0 66.7% 0 233.13s 16,254 19,070
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 29.76s 25 5,075
MiniMax M3 5.1 3.4 33.3% 1 33.25s 2,487 2,523
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 17.54s 887 4,470
MiniMax M3 9.8 10.0 100.0% 0 6.14s 103 920
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 5.79s 410 2,128
MiniMax M3 7.9 9.9 66.7% 0 49.91s 11,946 13,761
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 9.01s 450 1,256
MiniMax M3 10.0 10.0 100.0% 0 11.91s 281 555
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 180.87s 797 16,905
MiniMax M3 3.0 10.0 0.0% 0 100.80s 9,048 10,216

Quick Compare

Switch Comparison Pair