Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Qwen: Qwen3.7 Plus vs Mimo V2 Omni

Last updated at: 2026-06-03

Metric Qwen3.7 Plus Qwen3.7 Plus none Release: 2026-06-03 Mimo V2 Omni Mimo V2 Omni medium Release: 2026-03-18
Score 6.6 6.9
Rank #82 #75
Reliability 10.0 10.0
Consistency 10.0 8.7
Tests Correct
Attempt pass rate 50.0% 58.3%
Flaky tests 0 3
Total Runs 60 52
Cost per result 0.264 7.334
Total Cost $0.027 $0.683
Input Price $0.400 / 1M $1.722 / 1M
Output Price $1.600 / 1M $1.722 / 1M
Total Input Tokens 39,669 37,007
Output Tokens 6,572 1,952
Reasoning Tokens 0 357,306
Response Time (avg) 2.95s 41.16s
Response Time (max) 29.38s 299.23s
Response Time (total) 58.96s 823.26s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 6.5 10.0 50.0% 0 1.38s 696 349 0
Mimo V2 Omni 10.0 10.0 100.0% 0 2.75s 621 269 1,701
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 6.8 10.0 50.0% 0 2.77s 5,070 633 0
Mimo V2 Omni 3.4 4.8 16.7% 1 183.89s 4,787 292 174,314
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 10.0 10.0 100.0% 0 29.38s 14,952 4,505 0
Mimo V2 Omni 10.0 10.0 100.0% 0 25.87s 15,060 380 8,673
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 10.0 10.0 100.0% 0 1.43s 7,794 243 0
Mimo V2 Omni 10.0 10.0 100.0% 0 3.04s 6,002 155 591
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 3.0 10.0 0.0% 0 868ms 789 18 0
Mimo V2 Omni 3.0 10.0 0.0% 0 47.89s 735 155 68,398
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 5.3 10.0 0.0% 0 1.33s 522 78 0
Mimo V2 Omni 5.4 2.5 66.7% 1 3.61s 492 136 492
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 6.3 10.0 50.0% 0 929ms 711 72 0
Mimo V2 Omni 8.3 10.0 50.0% 0 4.99s 470 49 515
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 7.7 10.0 66.7% 0 1.71s 714 443 0
Mimo V2 Omni 5.9 7.2 55.6% 1 2.38s 410 210 860
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 10.0 10.0 100.0% 0 3.54s 8,211 222 0
Mimo V2 Omni 10.0 10.0 100.0% 0 13.98s 8,220 303 3,461
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.7 Plus 3.0 10.0 0.0% 0 1.21s 210 9 0
Mimo V2 Omni 3.0 10.0 0.0% 0 234.19s 210 3 98,301

Quick Compare

Switch Comparison Pair