Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

OpenAI: GPT-5 Mini vs Z.ai: GLM 5

Summary

GPT-5 Mini vs GLM 5 benchmark comparison: The average score is effectively tied at 8.5 vs 8.6. GPT-5 Mini has the lower benchmark cost at $0.159 vs $0.228. GPT-5 Mini is faster at 23.64s vs 33.54s, with pass rates of 63.5% vs 82.5%.

Recommended model: GPT-5 Mini - It has the strongest score in this comparison (8.5) and the best overall balance of cost and response time across all 2 models.

Last updated at: 2026-07-02

Metric GPT-5 Mini GPT-5 Mini medium Release: 2025-08-07 GLM 5 GLM 5 medium Release: 2026-02-12
Score 8.5 8.6
Rank #16 #15
Reliability 10.0 10.0
Consistency 9.1 8.5
Tests Correct
Attempt pass rate 63.5% 82.5%
Flaky tests 2 4
Total Runs 63 63
Cost per result 1.319 1.668
Total Cost $0.159 $0.228
Input Price $0.250 / 1M $0.600 / 1M
Output Price $2.000 / 1M $1.920 / 1M
Total Input Tokens 37,100 35,224
Output Tokens 6,801 21,570
Reasoning Tokens 67,690 102,996
Response Time (avg) 23.64s 33.54s
Response Time (max) 88.15s 99.85s
Response Time (total) 496.44s 435.99s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#16 GPT-5 Mini

medium
Cost
$0.007
Time
42.9s
Tokens
3,432 tok

#15 GLM 5

medium
Cost
$0.005
Time
20.7s
Tokens
2,068 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 7.1 7.6 66.7% 1 13.86s 606 1,715 6,378
GLM 5 10.0 10.0 100.0% 0 23.66s 555 480 7,056
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 10.0 10.0 100.0% 0 27.63s 7,302 658 17,152
GLM 5 10.0 10.0 100.0% 0 74.30s 7,254 2,997 52,930
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 10.0 10.0 100.0% 0 88.15s 14,118 754 11,520
GLM 5 10.0 10.0 100.0% 0 28.96s 12,804 662 3,242
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 10.0 10.0 100.0% 0 12.58s 7,140 453 3,200
GLM 5 7.1 5.6 83.3% 1 8.90s 5,508 567 3,734
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 3.6 7.2 22.2% 1 44.63s 515 293 14,016
GLM 5 3.5 4.4 33.3% 2 0ms 260 13,176 14,137
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 4.5 10.0 0.0% 0 13.50s 477 349 1,856
GLM 5 6.1 3.1 66.7% 1 14.69s 477 2,020 2,248
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 10.0 10.0 100.0% 0 11.59s 660 310 3,968
GLM 5 10.0 10.0 100.0% 0 7.25s 636 1,001 2,129
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 5.6 9.8 33.3% 0 15.20s 642 1,622 6,144
GLM 5 10.0 10.0 100.0% 0 11.33s 609 33 4,076
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 10.0 10.0 100.0% 0 18.64s 5,445 487 1,600
GLM 5 10.0 10.0 100.0% 0 15.93s 6,935 233 994
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
GPT-5 Mini 3.0 10.0 0.0% 0 9.99s 195 160 1,856
GLM 5 3.0 10.0 0.0% 0 67.37s 186 401 12,450

Quick Compare

Switch Comparison Pair