Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs MiniMax: MiniMax M2.5

Last updated at: 2026-03-12

Metric Seed-2.0-Lite Seed-2.0-Lite none Release: 2026-02-14 MiniMax M2.5 MiniMax M2.5 medium Release: 2026-02-12
Rank #45 #49
Avg Score 4.9 4.7
Consistency 7.4 5.6
Cost per result 0.214 4.981
Total Cost $0.015 $0.250
Tests Correct
Attempt pass rate 56.3% 60.4%
Flaky tests 5 9
Total Runs 48 48
Output Tokens 2,743 107,044
Reasoning Tokens 0 206,190
Response Time (avg) 2.49s 43.03s
Response Time (max) 6.70s 237.27s
Response Time (total) 39.91s 387.25s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Total Output Tokens

Avg Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 4.6 22.2% 2 2.93s 703 0
MiniMax M2.5 9.3 7.9 88.9% 1 32.42s 286 45,112
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 0.0% 0 6.59s 498 0
MiniMax M2.5 10.0 2.1 66.7% 1 60.39s 740 9,713
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 9.9 10.0 100.0% 0 1.82s 246 0
MiniMax M2.5 10.0 1.7 66.7% 2 7.48s 266 3,835
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 7.2 22.2% 1 1.33s 17 0
MiniMax M2.5 10.0 4.4 22.2% 2 237.27s 105,047 133,487
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.45s 294 0
MiniMax M2.5 3.0 2.5 33.3% 1 6.63s 25 1,686
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.06s 73 0
MiniMax M2.5 8.0 6.8 83.3% 1 4.64s 252 1,873
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 4.0 4.4 55.6% 2 2.46s 620 0
MiniMax M2.5 4.0 7.2 44.4% 1 11.54s 159 9,547
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.94s 292 0
MiniMax M2.5 10.0 10.0 100.0% 0 15.35s 269 937

Quick Compare

Switch Comparison Pair