Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

MiniMax: MiniMax M3 vs OpenAI: GPT-5.4 Nano

Summary

MiniMax M3 vs GPT-5.4 Nano benchmark comparison: MiniMax M3 leads on average score with 7.6 vs 7.5. GPT-5.4 Nano has the lower benchmark cost at $0.107 vs $0.131. GPT-5.4 Nano is faster at 11.95s vs 68.17s, with pass rates of 65.1% vs 63.5%.

Recommended model: GPT-5.4 Nano - Its score stays close to the best score here (7.5 vs 7.6), while responding about 5.7x faster than MiniMax M3.

Last updated at: 2026-06-12

Metric MiniMax M3 MiniMax M3 medium Release: 2026-06-01 GPT-5.4 Nano GPT-5.4 Nano medium Release: 2026-03-17
Score 7.6 7.5
Rank #43 #48
Reliability 9.6 10.0
Consistency 7.9 8.4
Tests Correct
Attempt pass rate 65.1% 63.5%
Flaky tests 5 4
Total Runs 63 63
Cost per result 1.187 0.969
Total Cost $0.131 $0.107
Input Price $0.300 / 1M $0.200 / 1M
Output Price $1.200 / 1M $1.250 / 1M
Total Input Tokens 46,546 35,434
Output Tokens 49,036 3,014
Reasoning Tokens 92,543 76,520
Response Time (avg) 68.17s 11.95s
Response Time (max) 431.03s 94.06s
Response Time (total) 1363.38s 250.98s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#43 MiniMax M3

medium
Cost
$0.012
Time
154.4s
Tokens
10,018 tok

#48 GPT-5.4 Nano

medium
Cost
$0.007
Time
24.6s
Tokens
4,943 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.5 3.7 66.7% 3 14.95s 2,526 874 3,414
GPT-5.4 Nano 8.3 10.0 75.0% 0 4.52s 606 683 2,254
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 6.1 6.5 55.6% 1 144.74s 5,804 6,223 32,667
GPT-5.4 Nano 6.1 4.7 66.7% 2 19.12s 7,305 516 20,778
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 65.30s 14,760 1,306 6,253
GPT-5.4 Nano 9.8 10.0 100.0% 0 24.13s 12,345 349 5,719
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 14.92s 8,088 514 3,164
GPT-5.4 Nano 10.0 10.0 100.0% 0 2.54s 7,140 234 516
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.5 9.3 33.3% 0 233.13s 869 16,254 19,070
GPT-5.4 Nano 5.9 7.2 55.6% 1 38.18s 619 60 43,325
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.1 3.4 33.3% 1 33.25s 954 2,487 2,523
GPT-5.4 Nano 4.5 10.0 0.0% 0 4.15s 477 179 443
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 9.8 10.0 100.0% 0 6.14s 1,623 103 920
GPT-5.4 Nano 9.8 10.0 100.0% 0 1.88s 660 95 521
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 7.9 9.9 66.7% 0 49.91s 2,079 11,946 13,761
GPT-5.4 Nano 4.1 7.2 22.2% 1 3.79s 642 594 1,408
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 11.91s 9,168 281 555
GPT-5.4 Nano 10.0 10.0 100.0% 0 7.71s 5,445 234 382
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 3.0 10.0 0.0% 0 100.80s 675 9,048 10,216
GPT-5.4 Nano 3.0 10.0 0.0% 0 4.81s 195 70 1,174

Quick Compare

Switch Comparison Pair