Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Elephant vs Z.ai: GLM 5 Turbo

Last updated at: 2026-04-14

Metric Elephant Elephant none Release: 2026-04-14 GLM 5 Turbo GLM 5 Turbo none Release: 2026-03-15
Score 5.2 5.5
Rank #81 #73
Consistency 9.6 9.2
Tests Correct
Attempt pass rate 31.5% 37.0%
Flaky tests 1 2
Total Runs 54 54
Cost per result 0.000 0.518
Total Cost $0.000 $0.032
Input Price $0.000 / 1M $1.200 / 1M
Output Price $0.000 / 1M $4.000 / 1M
Output Tokens 2,573 1,775
Reasoning Tokens 0 0
Response Time (avg) 1.23s 2.94s
Response Time (max) 3.81s 8.21s
Response Time (total) 22.16s 52.98s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 6.6 10.0 50.0% 0 963ms 610 0
GLM 5 Turbo 3.0 10.0 0.0% 0 2.84s 382 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 6.4 3.3 66.7% 1 1.39s 375 0
GLM 5 Turbo 5.3 3.4 33.3% 1 3.93s 505 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 3.0 10.0 0.0% 0 3.81s 731 0
GLM 5 Turbo 3.0 10.0 0.0% 0 4.89s 144 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 6.5 10.0 50.0% 0 1.04s 246 0
GLM 5 Turbo 10.0 10.0 100.0% 0 2.47s 204 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 3.0 10.0 0.0% 0 927ms 24 0
GLM 5 Turbo 5.3 10.0 33.3% 0 1.97s 25 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 4.0 10.0 0.0% 0 854ms 106 0
GLM 5 Turbo 4.2 9.9 0.0% 0 2.18s 48 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 9.8 10.0 100.0% 0 1.03s 81 0
GLM 5 Turbo 6.5 10.0 50.0% 0 2.13s 65 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 3.3 10.0 0.0% 0 849ms 170 0
GLM 5 Turbo 5.5 7.4 44.4% 1 2.43s 180 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Elephant 3.0 10.0 0.0% 0 2.79s 230 0
GLM 5 Turbo 10.0 10.0 100.0% 0 8.21s 222 0

Quick Compare

Switch Comparison Pair