Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

MiniMax: MiniMax M3 vs Qwen: Qwen3.7 Plus

Summary

MiniMax M3 vs Qwen3.7 Plus benchmark comparison: MiniMax M3 leads on average score with 7.6 vs 7.2. Qwen3.7 Plus has the lower benchmark cost at $0.023 vs $0.131. Qwen3.7 Plus is faster at 2.85s vs 68.17s, with pass rates of 65.1% vs 47.6%.

Recommended model: Qwen3.7 Plus - Its score stays close to the best score here (7.2 vs 7.6), while costing about 5.9x less than MiniMax M3.

Last updated at: 2026-06-12

Metric MiniMax M3 MiniMax M3 medium Release: 2026-06-01 Qwen3.7 Plus Qwen3.7 Plus none Release: 2026-06-03
Score 7.6 7.2
Rank #43 #61
Reliability 9.6 10.0
Consistency 7.9 10.0
Tests Correct
Attempt pass rate 65.1% 47.6%
Flaky tests 5 0
Total Runs 63 63
Cost per result 1.187 0.276
Total Cost $0.131 $0.023
Input Price $0.300 / 1M $0.320 / 1M
Output Price $1.200 / 1M $1.280 / 1M
Total Input Tokens 46,546 42,510
Output Tokens 49,036 6,578
Reasoning Tokens 92,543 0
Response Time (avg) 68.17s 2.85s
Response Time (max) 431.03s 29.38s
Response Time (total) 1363.38s 59.86s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#43 MiniMax M3

medium
Cost
$0.012
Time
154.4s
Tokens
10,018 tok

#61 Qwen3.7 Plus

none
Cost
$0.019
Time
213.5s
Tokens
11,960 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.5 3.7 66.7% 3 14.95s 2,526 874 3,414
Qwen3.7 Plus 6.5 10.0 50.0% 0 1.38s 696 349 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 6.1 6.5 55.6% 1 144.74s 5,804 6,223 32,667
Qwen3.7 Plus 5.5 10.0 33.3% 0 2.15s 7,911 639 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 65.30s 14,760 1,306 6,253
Qwen3.7 Plus 10.0 10.0 100.0% 0 29.38s 14,952 4,505 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 14.92s 8,088 514 3,164
Qwen3.7 Plus 10.0 10.0 100.0% 0 1.43s 7,794 243 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.5 9.3 33.3% 0 233.13s 869 16,254 19,070
Qwen3.7 Plus 3.0 10.0 0.0% 0 868ms 789 18 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 5.1 3.4 33.3% 1 33.25s 954 2,487 2,523
Qwen3.7 Plus 5.3 10.0 0.0% 0 1.33s 522 78 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 9.8 10.0 100.0% 0 6.14s 1,623 103 920
Qwen3.7 Plus 6.3 10.0 50.0% 0 929ms 711 72 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 7.9 9.9 66.7% 0 49.91s 2,079 11,946 13,761
Qwen3.7 Plus 7.7 10.0 66.7% 0 1.71s 714 443 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 10.0 10.0 100.0% 0 11.91s 9,168 281 555
Qwen3.7 Plus 10.0 10.0 100.0% 0 3.54s 8,211 222 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
MiniMax M3 3.0 10.0 0.0% 0 100.80s 675 9,048 10,216
Qwen3.7 Plus 3.0 10.0 0.0% 0 1.21s 210 9 0

Quick Compare

Switch Comparison Pair