Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

MiniMax: MiniMax M2.5 vs MoonshotAI: Kimi K2.5

Last updated at: 2026-05-29

Metric MiniMax M2.5 MiniMax M2.5 medium Release: 2026-02-12 Free Available Kimi K2.5 Kimi K2.5 none Release: 2026-01-27
Score 5.5 5.3
Rank #123 #133
Reliability 10.0 10.0
Consistency 6.4 8.9
Tests Correct
Attempt pass rate 48.3% 36.7%
Flaky tests 9 3
Total Runs 60 60
Cost per result 6.075 0.425
Total Cost $0.304 $0.026
Input Price $0.150 / 1M $0.400 / 1M
Output Price $1.150 / 1M $1.900 / 1M
Output Tokens 109,492 6,653
Reasoning Tokens 251,674 0
Response Time (avg) 49.87s 14.06s
Response Time (max) 237.27s 42.13s
Response Time (total) 598.39s 182.72s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 7.9 6.3 83.3% 2 20.82s 286 45,344
Kimi K2.5 3.6 8.4 8.3% 1 6.24s 373 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 3.5 9.8 0.0% 0 125.80s 354 27,037
Kimi K2.5 6.8 10.0 50.0% 0 35.97s 4,704 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 4.5 2.1 66.7% 1 60.39s 740 9,713
Kimi K2.5 2.8 2.1 33.3% 1 19.16s 748 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 4.6 1.7 66.7% 2 7.48s 266 3,835
Kimi K2.5 7.3 5.8 83.3% 1 42.13s 187 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 2.9 4.4 22.2% 2 237.27s 105,047 133,487
Kimi K2.5 5.3 10.0 33.3% 0 4.38s 29 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 3.8 2.5 33.3% 1 6.63s 25 1,686
Kimi K2.5 10.0 10.0 100.0% 0 4.00s 76 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 7.5 10.0 50.0% 0 621ms 156 1,495
Kimi K2.5 6.5 10.0 50.0% 0 2.67s 60 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 5.3 7.2 44.4% 1 11.21s 1,069 9,605
Kimi K2.5 3.0 10.0 0.0% 0 4.04s 236 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 10.0 10.0 100.0% 0 15.35s 269 937
Kimi K2.5 10.0 10.0 100.0% 0 13.99s 220 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
MiniMax M2.5 3.0 10.0 0.0% 0 80.79s 1,280 18,535
Kimi K2.5 3.0 10.0 0.0% 0 3.90s 20 0

Quick Compare

Switch Comparison Pair