Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

xAI: Grok 4.1 Fast vs Z.ai: GLM 5 Turbo

Last updated at: 2026-03-15

Metric Grok 4.1 Fast Grok 4.1 Fast medium Release: 2025-11-19 GLM 5 Turbo GLM 5 Turbo none Release: 2026-03-15
Rank #34 #53
Score 7.1 5.7
Consistency 7.9 9.5
Cost per result 0.563 0.467
Total Cost $0.051 $0.028
Tests Correct
Attempt pass rate 66.7% 39.6%
Flaky tests 4 1
Total Runs 48 48
Output Tokens 1,183 1,264
Reasoning Tokens 83,875 0
Response Time (avg) 26.35s 2.92s
Response Time (max) 121.79s 8.21s
Response Time (total) 237.11s 46.72s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 10.0 10.0 100.0% 0 5.65s 102 4,021
GLM 5 Turbo 3.0 10.0 0.0% 0 3.01s 376 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 10.0 10.0 100.0% 0 37.64s 261 12,272
GLM 5 Turbo 3.0 10.0 0.0% 0 4.89s 144 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 10.0 10.0 100.0% 0 6.63s 180 5,409
GLM 5 Turbo 10.0 10.0 100.0% 0 2.47s 204 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 5.8 4.4 66.7% 2 121.79s 11 37,657
GLM 5 Turbo 5.3 10.0 33.3% 0 1.97s 25 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 4.2 9.9 0.0% 0 16.25s 127 3,456
GLM 5 Turbo 4.2 9.9 0.0% 0 2.18s 48 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 6.6 10.0 50.0% 0 5.30s 55 3,489
GLM 5 Turbo 6.5 10.0 50.0% 0 2.13s 65 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 5.3 7.2 44.4% 1 8.08s 187 6,086
GLM 5 Turbo 5.5 7.4 44.4% 1 2.43s 180 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 2.8 1.6 33.3% 1 27.71s 260 11,485
GLM 5 Turbo 10.0 10.0 100.0% 0 8.21s 222 0

Quick Compare

Switch Comparison Pair