Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

MoonshotAI: Kimi K2.6 vs Z.ai: GLM 5V Turbo

Last updated at: 2026-06-04

Metric Kimi K2.6 Kimi K2.6 none Release: 2026-04-20 Free Available GLM 5V Turbo GLM 5V Turbo none Release: 2026-04-01
Score 5.5 5.8
Rank #124 #109
Reliability 10.0 10.0
Consistency 9.2 10.0
Tests Correct
Attempt pass rate 36.5% 38.1%
Flaky tests 2 0
Total Runs 63 63
Cost per result 1.267 0.645
Total Cost $0.079 $0.052
Input Price $0.684 / 1M $1.200 / 1M
Output Price $3.420 / 1M $4.000 / 1M
Total Input Tokens 32,916 37,100
Output Tokens 16,410 1,766
Reasoning Tokens 0 0
Response Time (avg) 13.27s 2.99s
Response Time (max) 238.89s 6.51s
Response Time (total) 278.57s 62.74s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 4.6 10.0 25.0% 0 1.39s 618 471 0
GLM 5V Turbo 4.8 10.0 25.0% 0 3.13s 555 281 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 5.5 9.8 33.3% 0 82.57s 5,986 14,754 0
GLM 5V Turbo 5.5 10.0 33.3% 0 3.13s 7,256 360 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 3.0 10.0 0.0% 0 3.38s 11,269 290 0
GLM 5V Turbo 3.0 10.0 0.0% 0 6.51s 12,708 276 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 10.0 10.0 100.0% 0 1.32s 7,014 201 0
GLM 5V Turbo 10.0 10.0 100.0% 0 3.81s 7,107 204 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 5.3 7.2 44.4% 1 1.48s 732 42 0
GLM 5V Turbo 5.3 10.0 33.3% 0 2.09s 687 24 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 5.4 3.5 33.3% 1 1.55s 477 138 0
GLM 5V Turbo 4.6 10.0 0.0% 0 2.22s 477 114 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 6.5 10.0 50.0% 0 1.64s 669 72 0
GLM 5V Turbo 6.5 10.0 50.0% 0 1.97s 636 60 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 3.1 9.9 0.0% 0 1.40s 651 185 0
GLM 5V Turbo 5.3 10.0 33.3% 0 2.40s 609 210 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 10.0 10.0 100.0% 0 4.46s 5,286 240 0
GLM 5V Turbo 10.0 10.0 100.0% 0 4.86s 6,879 222 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Kimi K2.6 3.0 10.0 0.0% 0 1.36s 214 17 0
GLM 5V Turbo 3.0 10.0 0.0% 0 2.23s 186 15 0

Quick Compare

Switch Comparison Pair