Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Qwen: Qwen3.6 Max Preview vs Xiaomi: MiMo-V2-Flash

Last updated at: 2026-04-27

Metric Qwen3.6 Max Preview Qwen3.6 Max Preview none Release: 2026-04-20 MiMo-V2-Flash MiMo-V2-Flash medium Release: 2025-12-16
Score 7.3 7.5
Rank #56 #53
Reliability 10.0 N/A
Consistency 8.7 8.6
Tests Correct
Attempt pass rate 66.7% 70.4%
Flaky tests 3 3
Total Runs 54 54
Cost per result 0.827 0.341
Total Cost $0.083 $0.038
Input Price $1.300 / 1M $0.090 / 1M
Output Price $7.800 / 1M $0.290 / 1M
Output Tokens 4,732 12,387
Reasoning Tokens 0 115,182
Response Time (avg) 3.38s 23.36s
Response Time (max) 20.51s 96.01s
Response Time (total) 60.83s 280.34s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 5.2 7.9 41.7% 1 2.63s 513 0
MiMo-V2-Flash 8.1 7.9 83.3% 1 15.85s 1,674 23,559
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 5.0 2.0 66.7% 1 3.45s 426 0
MiMo-V2-Flash 4.7 1.6 66.7% 1 13.03s 428 3,648
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 20.51s 2,842 0
MiMo-V2-Flash 9.8 10.0 100.0% 0 75.68s 442 26,859
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.87s 243 0
MiMo-V2-Flash 6.5 10.0 50.0% 0 0ms 153 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 7.7 10.0 66.7% 0 1.22s 18 0
MiMo-V2-Flash 5.9 7.2 55.6% 1 96.01s 8,374 42,461
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 4.3 10.0 0.0% 0 1.62s 76 0
MiMo-V2-Flash 4.0 10.0 0.0% 0 4.20s 87 488
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 8.4 6.9 83.3% 1 1.45s 69 0
MiMo-V2-Flash 10.0 10.0 100.0% 0 4.28s 75 3,504
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.38s 323 0
MiMo-V2-Flash 7.7 10.0 66.7% 0 3.77s 833 1,948
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 5.27s 222 0
MiMo-V2-Flash 10.0 10.0 100.0% 0 27.78s 321 12,715

Quick Compare

Switch Comparison Pair