Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

MoonshotAI: Kimi K2.5 vs OpenAI: GPT-5 Nano

Last updated at: 2026-05-19

Metric Kimi K2.5 Kimi K2.5 medium Release: 2026-01-27 GPT-5 Nano GPT-5 Nano medium Release: 2025-08-07
Score 6.8 6.2
Rank #76 #90
Reliability 10.0 10.0
Consistency 7.0 7.0
Tests Correct
Attempt pass rate 68.4% 57.9%
Flaky tests 7 7
Total Runs 57 57
Cost per result 2.616 0.856
Total Cost $0.236 $0.069
Input Price $0.400 / 1M $0.050 / 1M
Output Price $1.900 / 1M $0.400 / 1M
Output Tokens 42,188 5,214
Reasoning Tokens 92,514 162,432
Response Time (avg) 73.39s 42.13s
Response Time (max) 150.77s 204.02s
Response Time (total) 880.65s 505.59s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 7.3 5.8 83.3% 2 51.38s 2,789 8,880
GPT-5 Nano 6.5 7.9 58.3% 1 25.50s 1,221 21,184
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 4.7 1.6 66.7% 1 150.77s 1,269 9,749
GPT-5 Nano 6.7 3.5 66.7% 1 40.73s 480 12,992
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 10.0 10.0 100.0% 0 71.37s 703 3,713
GPT-5 Nano 10.0 10.0 100.0% 0 65.96s 578 17,984
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 10.0 10.0 100.0% 0 49.78s 563 7,940
GPT-5 Nano 3.7 1.7 50.0% 2 21.42s 453 10,560
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 3.5 4.4 33.3% 2 137.29s 20,753 30,564
GPT-5 Nano 5.2 4.4 55.6% 2 204.02s 237 64,448
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 6.5 3.4 66.7% 1 69.73s 3,815 4,262
GPT-5 Nano 4.1 10.0 0.0% 0 17.51s 202 4,608
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 10.0 10.0 100.0% 0 92.47s 5,371 6,547
GPT-5 Nano 9.8 10.0 100.0% 0 11.90s 382 4,096
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 5.3 7.3 44.4% 1 45.40s 6,671 12,403
GPT-5 Nano 5.3 7.2 44.4% 1 19.81s 869 13,440
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 10.0 10.0 100.0% 0 31.74s 242 812
GPT-5 Nano 10.0 10.0 100.0% 0 33.30s 558 6,976
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Kimi K2.5 3.0 10.0 0.0% 0 83.95s 12 7,644
GPT-5 Nano 3.0 10.0 0.0% 0 20.13s 234 6,144

Quick Compare

Switch Comparison Pair