Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Qwen: Qwen3.6 Max Preview vs Xiaomi: MiMo-V2-Pro

Last updated at: 2026-05-01

Metric Qwen3.6 Max Preview Qwen3.6 Max Preview none Release: 2026-04-20 MiMo-V2-Pro MiMo-V2-Pro medium Release: 2026-03-18
Score 7.5 8.1
Rank #52 #33
Reliability 10.0 N/A
Consistency 9.1 8.6
Tests Correct
Attempt pass rate 68.5% 77.8%
Flaky tests 2 3
Total Runs 54 48
Cost per result 0.752 1.320
Total Cost $0.083 $0.159
Input Price $1.040 / 1M $1.000 / 1M
Output Price $6.240 / 1M $3.000 / 1M
Output Tokens 4,732 2,360
Reasoning Tokens 0 38,320
Response Time (avg) 3.38s 12.27s
Response Time (max) 20.51s 64.71s
Response Time (total) 60.83s 208.56s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 5.2 7.9 41.7% 1 2.63s 513 0
MiMo-V2-Pro 10.0 10.0 100.0% 0 3.06s 223 1,107
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 5.0 2.0 66.7% 1 3.45s 426 0
MiMo-V2-Pro 10.0 10.0 100.0% 0 52.12s 485 11,361
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 20.51s 2,842 0
MiMo-V2-Pro 4.7 1.6 66.7% 1 64.71s 380 14,186
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.87s 243 0
MiMo-V2-Pro 7.3 5.8 83.3% 1 17.20s 260 7,484
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 7.7 10.0 66.7% 0 1.22s 18 0
MiMo-V2-Pro 5.3 10.0 33.3% 0 6.00s 155 1,048
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 4.3 10.0 0.0% 0 1.62s 76 0
MiMo-V2-Pro 10.0 10.0 100.0% 0 4.06s 198 424
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 9.8 10.0 100.0% 0 1.45s 69 0
MiMo-V2-Pro 9.9 10.0 100.0% 0 3.36s 83 667
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.38s 323 0
MiMo-V2-Pro 7.0 7.2 55.6% 1 4.71s 313 1,179
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 5.27s 222 0
MiMo-V2-Pro 10.0 10.0 100.0% 0 8.19s 263 864

Quick Compare

Switch Comparison Pair