Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Xiaomi: MiMo-V2.5-Pro

Summary

Gemini 3.1 Flash Lite vs MiMo-V2.5-Pro benchmark comparison: Gemini 3.1 Flash Lite leads on average score with 6.4 vs 5.5. Gemini 3.1 Flash Lite has the lower benchmark cost at $0.013 vs $0.017. Gemini 3.1 Flash Lite is faster at 1.33s vs 1.78s, with pass rates of 54.0% vs 39.7%.

Recommended model: Gemini 3.1 Flash Lite - It has the strongest score in this comparison (6.4) and the best overall balance of cost and response time across all 2 models.

Last updated at: 2026-06-04

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite minimal Release: 2026-05-08 MiMo-V2.5-Pro MiMo-V2.5-Pro none Release: 2026-04-22
Score 6.4 5.5
Rank #87 #123
Reliability 10.0 10.0
Consistency 8.8 8.6
Tests Correct
Attempt pass rate 54.0% 39.7%
Flaky tests 3 4
Total Runs 63 63
Cost per result 0.130 0.648
Total Cost $0.013 $0.017
Input Price $0.250 / 1M $0.435 / 1M
Output Price $1.500 / 1M $0.870 / 1M
Total Input Tokens 36,973 30,724
Output Tokens 2,487 3,043
Reasoning Tokens 0 0
Response Time (avg) 1.33s 1.78s
Response Time (max) 4.49s 8.32s
Response Time (total) 27.91s 37.42s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#87 Gemini 3.1 Flash Lite

minimal
Cost
$0.001
Time
3.7s
Tokens
635 tok

#123 MiMo-V2.5-Pro

none
Cost
$0.004
Time
46.4s
Tokens
4,025 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 8.3 10.0 75.0% 0 1.10s 500 639 0
MiMo-V2.5-Pro 3.3 8.1 8.3% 1 2.67s 645 994 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 5.5 10.0 33.3% 0 831ms 8,126 666 0
MiMo-V2.5-Pro 4.3 7.8 22.2% 1 1.41s 6,559 485 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 2.53s 12,870 357 0
MiMo-V2.5-Pro 3.0 10.0 0.0% 0 3.54s 4,695 596 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.04s 7,552 279 0
MiMo-V2.5-Pro 10.0 10.0 100.0% 0 1.32s 7,758 249 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 2.9 7.2 11.1% 1 1.02s 641 15 0
MiMo-V2.5-Pro 5.3 10.0 33.3% 0 877ms 753 27 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 791ms 490 63 0
MiMo-V2.5-Pro 4.0 10.0 0.0% 0 2.58s 498 87 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 932ms 615 72 0
MiMo-V2.5-Pro 6.4 10.0 50.0% 0 1.03s 684 66 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 6.0 4.6 66.7% 2 2.15s 564 153 0
MiMo-V2.5-Pro 6.7 4.7 77.8% 2 1.30s 678 267 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 3.51s 5,457 234 0
MiMo-V2.5-Pro 10.0 10.0 100.0% 0 3.30s 8,238 258 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 724ms 158 9 0
MiMo-V2.5-Pro 3.0 10.0 0.0% 0 1.89s 216 14 0

Quick Compare

Switch Comparison Pair