Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3 Flash Preview vs Xiaomi: MiMo-V2.5-Pro

Summary

Gemini 3 Flash Preview vs MiMo-V2.5-Pro benchmark comparison: Gemini 3 Flash Preview leads on average score with 9.8 vs 5.5. MiMo-V2.5-Pro has the lower benchmark cost at $0.017 vs $0.667. MiMo-V2.5-Pro is faster at 1.78s vs 18.64s, with pass rates of 98.4% vs 39.7%.

Recommended model: MiMo-V2.5-Pro - It offers the best overall trade-off: a competitive score (5.5), lower cost than Gemini 3 Flash Preview, and balanced response time.

Last updated at: 2026-06-04

Metric Gemini 3 Flash Preview Gemini 3 Flash Preview medium Release: 2025-12-17 MiMo-V2.5-Pro MiMo-V2.5-Pro none Release: 2026-04-22
Score 9.8 5.5
Rank #1 #123
Reliability 10.0 10.0
Consistency 9.7 8.6
Tests Correct
Attempt pass rate 98.4% 39.7%
Flaky tests 1 4
Total Runs 63 63
Cost per result 3.335 0.648
Total Cost $0.667 $0.017
Input Price $0.500 / 1M $0.435 / 1M
Output Price $3.000 / 1M $0.870 / 1M
Total Input Tokens 37,017 30,724
Output Tokens 2,006 3,043
Reasoning Tokens 214,153 0
Response Time (avg) 18.64s 1.78s
Response Time (max) 117.26s 8.32s
Response Time (total) 391.35s 37.42s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#1 Gemini 3 Flash Preview

medium
Cost
$0.010
Time
17.9s
Tokens
3,236 tok

#123 MiMo-V2.5-Pro

none
Cost
$0.004
Time
46.4s
Tokens
4,025 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.88s 494 330 3,216
MiMo-V2.5-Pro 3.3 8.1 8.3% 1 2.67s 645 994 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 8.6 7.6 88.9% 1 84.40s 8,122 462 161,084
MiMo-V2.5-Pro 4.3 7.8 22.2% 1 1.41s 6,559 485 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 22.42s 12,873 351 10,485
MiMo-V2.5-Pro 3.0 10.0 0.0% 0 3.54s 4,695 596 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.43s 7,548 279 4,893
MiMo-V2.5-Pro 10.0 10.0 100.0% 0 1.32s 7,758 249 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 15.27s 633 12 21,684
MiMo-V2.5-Pro 5.3 10.0 33.3% 0 877ms 753 27 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.19s 486 72 1,905
MiMo-V2.5-Pro 4.0 10.0 0.0% 0 2.58s 498 87 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.04s 615 72 2,709
MiMo-V2.5-Pro 6.4 10.0 50.0% 0 1.03s 684 66 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.05s 558 183 4,365
MiMo-V2.5-Pro 6.7 4.7 77.8% 2 1.30s 678 267 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 12.60s 5,532 234 1,487
MiMo-V2.5-Pro 10.0 10.0 100.0% 0 3.30s 8,238 258 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.50s 156 11 2,325
MiMo-V2.5-Pro 3.0 10.0 0.0% 0 1.89s 216 14 0

Quick Compare

Switch Comparison Pair