Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Xiaomi: MiMo-V2-Pro

Last updated at: 2026-05-08

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite low Release: 2026-05-08 MiMo-V2-Pro MiMo-V2-Pro medium Release: 2026-03-18
Score 7.6 7.7
Rank #44 #37
Reliability 10.0 9.4
Consistency 9.2 8.2
Tests Correct
Attempt pass rate 68.4% 77.2%
Flaky tests 2 4
Total Runs 57 57
Cost per result 0.203 1.767
Total Cost $0.025 $0.212
Input Price $0.250 / 1M $1.000 / 1M
Output Price $1.500 / 1M $3.000 / 1M
Output Tokens 2,702 2,514
Reasoning Tokens 8,596 55,816
Response Time (avg) 1.92s 16.18s
Response Time (max) 5.66s 82.71s
Response Time (total) 36.49s 307.48s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 7.3 6.2 75.0% 2 1.84s 1,013 1,548
MiMo-V2-Pro 10.0 10.0 100.0% 0 2.86s 251 1,154
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.46s 441 408
MiMo-V2-Pro 10.0 10.0 100.0% 0 52.12s 485 11,361
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 4.48s 348 975
MiMo-V2-Pro 4.7 1.6 66.7% 1 64.71s 380 14,186
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.44s 291 697
MiMo-V2-Pro 7.3 5.8 83.3% 1 17.20s 260 7,484
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 5.3 10.0 33.3% 0 1.52s 15 1,214
MiMo-V2-Pro 5.3 10.0 33.3% 0 8.82s 170 2,158
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 1.37s 69 438
MiMo-V2-Pro 10.0 10.0 100.0% 0 4.92s 184 400
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.52s 72 760
MiMo-V2-Pro 9.9 10.0 100.0% 0 3.36s 83 667
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.40s 210 1,191
MiMo-V2-Pro 6.4 4.4 77.8% 2 5.26s 410 1,700
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 5.66s 234 945
MiMo-V2-Pro 10.0 10.0 100.0% 0 8.19s 263 864
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 1.46s 9 420
MiMo-V2-Pro 3.0 10.0 0.0% 0 82.71s 28 15,842

Quick Compare

Switch Comparison Pair