AI BENCHY
Compare
❤️ Made by XCS

Model Name

MoonshotAI: Kimi K2.5

Last updated at : Feb 19, 2026

Metric MoonshotAI: Kimi K2.5
Rank#9
CompanyMoonshotAI
Score 6.42
Consistency 8.00
Cost per result 2.4097
Total Cost $0.16868
Tests Correct 7/12
Attempt pass rate 72.2%
Flaky tests 3
Output Tokens 30,235
Reasoning Tokens 53,179

Category Breakdown

Category Fully passed tests Score Consistency Attempt pass rate Flaky tests Reasoning score Cost
Anti-AI Tricks 2/2 10.00 10.00 100.0% 0 9.77 $0.00634
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.67 $0.02325
Domain specific 0/3 1.00 4.41 33.3% 2 7.22 $0.09579
Instructions following 2/2 9.50 10.00 100.0% 0 9.42 $0.01428
Puzzle Solving 1/3 5.00 7.61 55.6% 1 9.26 $0.02904

Compared models

Compare MoonshotAI: Kimi K2.5 against...

#8 · X Ai

xAI: Grok 4.1 Fast

Reasoning (medium)

Score: 6.42

Consistency: 8.60

Attempt pass rate: 66.7%

Flaky tests: 2

Cost per result: 0.4800

Tests Correct: 7/12

Total Cost: $0.03360

Compare

#10 · Google

Google: Gemini 3 Flash Preview

No Reasoning

Score: 6.25

Consistency: 8.60

Attempt pass rate: 66.7%

Flaky tests: 2

Cost per result: 0.0754

Tests Correct: 7/12

Total Cost: $0.00528

Compare

#7 · Z.ai

Z.ai: GLM 5

Reasoning (medium)

Score: 6.83

Consistency: 7.86

Attempt pass rate: 80.6%

Flaky tests: 3

Cost per result: 1.3424

Tests Correct: 8/12

Total Cost: $0.10740

Compare

Quick Compare

Compare MoonshotAI: Kimi K2.5 against...