AI BENCHY
Compare
❤️ Made by XCS
Your ad here

Model Name

MoonshotAI: Kimi K2.5

No Reasoning

Last updated at : Feb 24, 2026

Metric MoonshotAI: Kimi K2.5
Rank#24
CompanyMoonshotAI
Score 3.62
Consistency 8.84
Cost per result 0.2413
Total Cost $0.00725
Tests Correct
Attempt pass rate 30.8%
Flaky tests 2
Output Tokens 1,695
Reasoning Tokens 0
Response Time (avg)11378ms
Response Time (total)11378ms
Response Time (max)11378ms

Category Breakdown

Category Fully passed tests Score Consistency Attempt pass rate Flaky tests Reasoning score Response Time (avg) Cost
Anti-AI Tricks 2.67 7.86 11.1% 1 - 11378ms $0.00121
Data parsing and extraction 5.50 5.81 83.3% 1 - 0ms $0.00455
Domain specific 4.00 10.00 33.3% 0 - 0ms $0.00027
Instructions following 5.00 9.99 50.0% 0 - 0ms $0.00035
Puzzle Solving 2.00 9.92 0.0% 0 - 0ms $0.00090

Compared models

Compare MoonshotAI: Kimi K2.5 against...

#23 · Z.ai

Z.ai: GLM 4.7 Flash

Reasoning (medium)

Score: 3.69

Consistency: 6.15

Attempt pass rate: 48.7%

Flaky tests: 6

Cost per result: 0.2600

Tests Correct:

Total Cost: $0.01041

Compare

#25 · X Ai

xAI: Grok 4.1 Fast

No Reasoning

Score: 3.15

Consistency: 9.24

Attempt pass rate: 28.2%

Flaky tests: 1

Cost per result: 0.1153

Tests Correct:

Total Cost: $0.00346

Compare

#22 · Xiaomi

Xiaomi: MiMo-V2-Flash

Reasoning (medium)

Score: 3.77

Consistency: 7.46

Attempt pass rate: 43.6%

Flaky tests: 4

Cost per result: 0.5072

Tests Correct:

Total Cost: $0.02029

Compare

Quick Compare

Compare MoonshotAI: Kimi K2.5 against...