Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Qwen: Qwen3.6 27B vs Xiaomi: MiMo-V2-Omni

Last updated at: 2026-05-22

Metric Qwen3.6 27B Qwen3.6 27B medium Release: 2026-04-20 MiMo-V2-Omni MiMo-V2-Omni medium Release: 2026-03-18
Score 6.6 6.9
Rank #83 #72
Reliability 9.9 10.0
Consistency 8.1 8.7
Tests Correct
Attempt pass rate 58.3% 58.3%
Flaky tests 5 3
Total Runs 60 52
Cost per result 3.015 7.334
Total Cost $0.272 $0.734
Input Price $0.317 / 1M $0.400 / 1M
Output Price $3.200 / 1M $2.000 / 1M
Output Tokens 13,007 1,952
Reasoning Tokens 105,697 357,306
Response Time (avg) 57.65s 41.16s
Response Time (max) 168.22s 299.23s
Response Time (total) 1153.04s 823.26s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 8.3 10.0 75.0% 0 12.62s 582 4,311
MiMo-V2-Omni 10.0 10.0 100.0% 0 2.75s 269 1,701
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 6.6 10.0 50.0% 0 165.39s 4,760 26,668
MiMo-V2-Omni 3.4 4.8 16.7% 1 183.89s 292 174,314
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 7.0 3.7 66.7% 1 83.07s 2,088 14,689
MiMo-V2-Omni 10.0 10.0 100.0% 0 25.87s 380 8,673
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 3.5 1.4 50.0% 2 37.30s 568 9,404
MiMo-V2-Omni 10.0 10.0 100.0% 0 3.04s 155 591
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 2.9 7.2 11.1% 1 73.38s 3,510 20,352
MiMo-V2-Omni 3.0 10.0 0.0% 0 47.89s 155 68,398
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 6.5 3.4 66.7% 1 39.53s 81 3,045
MiMo-V2-Omni 5.4 2.5 66.7% 1 3.61s 136 492
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 10.0 10.0 100.0% 0 37.96s 346 6,548
MiMo-V2-Omni 8.3 10.0 50.0% 0 4.99s 49 515
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 7.7 10.0 66.7% 0 60.21s 281 11,919
MiMo-V2-Omni 5.9 7.2 55.6% 1 2.38s 210 860
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 10.0 10.0 100.0% 0 16.88s 390 2,954
MiMo-V2-Omni 10.0 10.0 100.0% 0 13.98s 303 3,461
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 27B 3.0 10.0 0.0% 0 80.99s 401 5,807
MiMo-V2-Omni 3.0 10.0 0.0% 0 234.19s 3 98,301

Quick Compare

Switch Comparison Pair