Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

OpenAI: GPT-5.2 vs Z.ai: GLM 5V Turbo

Last updated at: 2026-05-19

Metric GPT-5.2 GPT-5.2 medium Release: 2025-12-11 GLM 5V Turbo GLM 5V Turbo medium Release: 2026-04-01
Score 7.2 7.5
Rank #65 #54
Reliability 10.0 10.0
Consistency 8.2 7.6
Tests Correct
Attempt pass rate 68.4% 73.7%
Flaky tests 4 6
Total Runs 57 57
Cost per result 3.609 2.919
Total Cost $0.397 $0.322
Input Price $1.750 / 1M $1.200 / 1M
Output Price $14.000 / 1M $4.000 / 1M
Output Tokens 2,731 2,373
Reasoning Tokens 22,200 66,463
Response Time (avg) 15.22s 16.33s
Response Time (max) 77.80s 67.08s
Response Time (total) 182.59s 310.29s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 6.5 8.0 58.3% 1 7.81s 567 2,002
GLM 5V Turbo 7.2 6.1 75.0% 2 10.76s 587 7,872
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 10.0 10.0 100.0% 0 15.12s 467 2,166
GLM 5V Turbo 10.0 10.0 100.0% 0 13.78s 404 4,628
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 10.0 10.0 100.0% 0 14.06s 291 1,757
GLM 5V Turbo 6.9 3.8 66.7% 1 15.06s 403 2,523
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 10.0 10.0 100.0% 0 3.15s 234 420
GLM 5V Turbo 10.0 10.0 100.0% 0 9.60s 236 4,333
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 5.9 7.2 55.6% 1 77.80s 42 10,342
GLM 5V Turbo 5.3 7.2 44.4% 1 38.15s 32 29,035
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 3.7 9.7 0.0% 0 4.32s 162 269
GLM 5V Turbo 10.0 10.0 100.0% 0 11.09s 131 2,183
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 9.9 10.0 100.0% 0 3.12s 94 614
GLM 5V Turbo 9.9 10.0 100.0% 0 3.74s 72 1,813
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 7.6 7.3 77.8% 1 5.47s 609 938
GLM 5V Turbo 7.6 7.2 77.8% 1 10.91s 193 5,789
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 4.7 1.6 66.7% 1 10.30s 239 469
GLM 5V Turbo 7.0 3.7 66.7% 1 12.53s 293 765
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5.2 3.0 10.0 0.0% 0 28.18s 26 3,223
GLM 5V Turbo 3.0 10.0 0.0% 0 40.96s 22 7,522

Quick Compare

Switch Comparison Pair