Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

MiniMax: MiniMax M2.7 vs OpenAI: GPT-4o-mini

Summary

MiniMax M2.7 vs GPT-4o-mini benchmark comparison: MiniMax M2.7 leads on average score with 5.2 vs 5.0. GPT-4o-mini has the lower benchmark cost at $0.006 vs $0.075. GPT-4o-mini is faster at 1.77s vs 38.18s, with pass rates of 46.0% vs 23.8%.

Recommended model: GPT-4o-mini - Its score stays close to the best score here (5.0 vs 5.2), while costing about 12.6x less than MiniMax M2.7.

Last updated at: 2026-07-02

Metric MiniMax M2.7 MiniMax M2.7 medium Release: 2026-03-18 GPT-4o-mini GPT-4o-mini none Release: 2024-07-18
Score 5.2 5.0
Rank #132 #144
Reliability 10.0 10.0
Consistency 6.8 9.9
Tests Correct
Attempt pass rate 46.0% 23.8%
Flaky tests 8 0
Total Runs 63 63
Cost per result 2.494 0.119
Total Cost $0.075 $0.006
Input Price $0.180 / 1M $0.150 / 1M
Output Price $0.720 / 1M $0.600 / 1M
Total Input Tokens 34,371 31,518
Output Tokens 8,981 1,982
Reasoning Tokens 89,812 0
Response Time (avg) 38.18s 1.77s
Response Time (max) 196.21s 7.58s
Response Time (total) 763.60s 24.80s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#132 MiniMax M2.7

medium
Cost
$0.022
Time
22.8s
Tokens
9,250 tok

#144 GPT-4o-mini

none
Cost
$0.001
Time
6.6s
Tokens
742 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 7.9 6.3 83.3% 2 40.32s 654 3,010 17,716
GPT-4o-mini 4.8 10.0 25.0% 0 1.34s 618 186 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 5.7 9.1 33.3% 0 101.89s 2,961 1,231 38,841
GPT-4o-mini 3.2 9.6 0.0% 0 1.63s 7,314 367 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 4.7 1.6 66.7% 1 41.03s 14,233 369 4,480
GPT-4o-mini 3.0 10.0 0.0% 0 7.58s 8,298 568 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 6.3 5.8 66.7% 1 21.95s 7,152 187 5,882
GPT-4o-mini 10.0 10.0 100.0% 0 1.27s 7,161 183 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 3.0 10.0 0.0% 0 19.00s 245 8 2,796
GPT-4o-mini 3.0 10.0 0.0% 0 637ms 732 15 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 3.9 2.5 33.3% 1 38.70s 486 92 5,204
GPT-4o-mini 4.0 10.0 0.0% 0 909ms 480 66 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 3.8 5.8 33.3% 1 12.80s 687 350 2,600
GPT-4o-mini 6.3 10.0 50.0% 0 1.11s 666 69 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 5.9 7.2 55.6% 1 24.87s 675 362 7,840
GPT-4o-mini 3.5 10.0 0.0% 0 1.21s 651 308 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 4.7 1.6 66.7% 1 12.05s 7,067 304 1,001
GPT-4o-mini 10.0 10.0 100.0% 0 2.51s 5,400 205 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M2.7 3.0 10.0 0.0% 0 22.77s 211 3,068 3,452
GPT-4o-mini 3.0 10.0 0.0% 0 794ms 198 15 0

Quick Compare

Switch Comparison Pair