Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

OpenAI: GPT-5.5 vs Xiaomi: MiMo-V2-Flash

Last updated at: 2026-04-24

Metric GPT-5.5 GPT-5.5 none Release: 2026-04-24 MiMo-V2-Flash MiMo-V2-Flash medium Release: 2025-12-16
Score 6.8 7.5
Rank #58 #49
Reliability N/A N/A
Consistency 8.3 8.6
Tests Correct
Attempt pass rate 61.1% 70.4%
Flaky tests 4 3
Total Runs 54 54
Cost per result 2.162 0.341
Total Cost $0.195 $0.038
Input Price $5.000 / 1M $0.090 / 1M
Output Price $30.000 / 1M $0.290 / 1M
Output Tokens 1,910 12,387
Reasoning Tokens 0 115,182
Response Time (avg) 1.83s 23.36s
Response Time (max) 5.56s 96.01s
Response Time (total) 32.86s 280.34s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 6.9 7.9 66.7% 1 1.31s 213 0
MiMo-V2-Flash 8.1 7.9 83.3% 1 15.85s 1,674 23,559
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 10.0 10.0 100.0% 0 2.05s 426 0
MiMo-V2-Flash 4.7 1.6 66.7% 1 13.03s 428 3,648
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 3.0 10.0 0.0% 0 5.56s 300 0
MiMo-V2-Flash 9.8 10.0 100.0% 0 75.68s 442 26,859
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 10.0 10.0 100.0% 0 1.18s 222 0
MiMo-V2-Flash 6.5 10.0 50.0% 0 0ms 153 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 2.9 7.2 11.1% 1 1.31s 52 0
MiMo-V2-Flash 5.9 7.2 55.6% 1 96.01s 8,374 42,461
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 10.0 10.0 100.0% 0 3.41s 124 0
MiMo-V2-Flash 4.0 10.0 0.0% 0 4.20s 87 488
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 6.2 5.8 66.7% 1 1.15s 81 0
MiMo-V2-Flash 10.0 10.0 100.0% 0 4.28s 75 3,504
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 6.7 7.9 55.6% 1 1.36s 245 0
MiMo-V2-Flash 7.7 10.0 66.7% 0 3.77s 833 1,948
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.5 10.0 10.0 100.0% 0 3.90s 247 0
MiMo-V2-Flash 10.0 10.0 100.0% 0 27.78s 321 12,715

Quick Compare

Switch Comparison Pair