AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

MoonshotAI: Kimi K2.5

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo MoonshotAI: Kimi K2.5
Nafasi#9
KampuniMoonshotAI
Score 6.42
Uthabiti 8.00
Gharama kwa matokeo 2.4097
Jumla ya gharama $0.16868
Majaribio sahihi 7/12
Kiwango cha kupita kwa kila jaribio 72.2%
Majaribio yasiyo thabiti 3
Tokeni za matokeo 30,235
Tokeni za hoja 53,179

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 2/2 10.00 10.00 100.0% 0 9.77 $0.00634
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.67 $0.02325
Domain specific 0/3 1.00 4.41 33.3% 2 7.22 $0.09579
Instructions following 2/2 9.50 10.00 100.0% 0 9.42 $0.01428
Puzzle Solving 1/3 5.00 7.61 55.6% 1 9.26 $0.02904

Modeli zilizolinganishwa

Linganisha MoonshotAI: Kimi K2.5 dhidi ya...

#8 · X Ai

xAI: Grok 4.1 Fast

Uchambuzi (medium)

Score: 6.42

Uthabiti: 8.60

Kiwango cha kupita kwa kila jaribio: 66.7%

Majaribio yasiyo thabiti: 2

Gharama kwa matokeo: 0.4800

Majaribio sahihi: 7/12

Jumla ya gharama: $0.03360

Linganisha

#10 · Google

Google: Gemini 3 Flash Preview

Bila uchambuzi

Score: 6.25

Uthabiti: 8.60

Kiwango cha kupita kwa kila jaribio: 66.7%

Majaribio yasiyo thabiti: 2

Gharama kwa matokeo: 0.0754

Majaribio sahihi: 7/12

Jumla ya gharama: $0.00528

Linganisha

#7 · Z.ai

Z.ai: GLM 5

Uchambuzi (medium)

Score: 6.83

Uthabiti: 7.86

Kiwango cha kupita kwa kila jaribio: 80.6%

Majaribio yasiyo thabiti: 3

Gharama kwa matokeo: 1.3424

Majaribio sahihi: 8/12

Jumla ya gharama: $0.10740

Linganisha

Ulinganisho wa haraka

Linganisha MoonshotAI: Kimi K2.5 dhidi ya...