Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Xiaomi: MiMo-V2.5

Summary

Gemini 3.1 Flash Lite vs MiMo-V2.5 benchmark comparison: MiMo-V2.5 leads on average score with 6.7 vs 6.1. Gemini 3.1 Flash Lite has the lower benchmark cost at $0.013 vs $0.063. Gemini 3.1 Flash Lite is faster at 1.06s vs 27.11s, with pass rates of 52.4% vs 69.8%.

Recommended model: Gemini 3.1 Flash Lite - Its score stays close to the best score here (6.1 vs 6.7), while costing about 4.8x less than MiMo-V2.5.

Last updated at: 2026-06-18

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite none Release: 2026-05-08 MiMo-V2.5 MiMo-V2.5 medium Release: 2026-04-22
Score 6.1 6.7
Rank #96 #76
Reliability 10.0 10.0
Consistency 8.6 8.1
Tests Correct
Attempt pass rate 52.4% 69.8%
Flaky tests 4 5
Total Runs 63 63
Cost per result 0.144 2.966
Total Cost $0.013 $0.063
Input Price $0.250 / 1M $0.140 / 1M
Output Price $1.500 / 1M $0.280 / 1M
Total Input Tokens 36,710 41,838
Output Tokens 2,484 2,827
Reasoning Tokens 0 198,898
Response Time (avg) 1.06s 27.11s
Response Time (max) 2.97s 162.44s
Response Time (total) 22.35s 569.38s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#96 Gemini 3.1 Flash Lite

none
Cost
$0.001
Time
4.5s
Tokens
727 tok

#76 MiMo-V2.5

medium
Cost
$0.002
Time
54.8s
Tokens
5,247 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 7.5 8.4 66.7% 1 1.07s 506 639 0
MiMo-V2.5 10.0 10.0 100.0% 0 4.14s 621 281 1,739
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 5.5 10.0 33.3% 0 938ms 8,128 666 0
MiMo-V2.5 6.2 4.7 66.7% 2 97.14s 7,422 557 81,977
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 2.73s 12,870 357 0
MiMo-V2.5 10.0 10.0 100.0% 0 16.86s 15,060 363 7,609
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 843ms 7,267 279 0
MiMo-V2.5 2.7 5.7 16.7% 1 6.33s 7,746 306 5,714
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 2.9 7.2 11.1% 1 762ms 647 15 0
MiMo-V2.5 5.3 10.0 33.3% 0 34.53s 735 507 49,478
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 992ms 486 63 0
MiMo-V2.5 5.4 2.5 66.7% 1 5.37s 492 121 418
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 859ms 619 72 0
MiMo-V2.5 9.9 10.0 100.0% 0 1.80s 672 88 801
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 6.3 4.8 66.7% 2 720ms 570 150 0
MiMo-V2.5 8.2 7.2 88.9% 1 20.25s 660 279 33,254
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 2.97s 5,457 234 0
MiMo-V2.5 10.0 10.0 100.0% 0 7.29s 8,220 303 2,424
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 733ms 160 9 0
MiMo-V2.5 3.0 10.0 0.0% 0 51.29s 210 22 15,484

Quick Compare

Switch Comparison Pair