Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

HY3 Preview vs Z.ai: GLM 5

Last updated at: 2026-04-23

Metric HY3 Preview HY3 Preview medium Release: 2026-04-22 Free Available GLM 5 GLM 5 none Release: 2026-02-12
Score 8.1 6.6
Rank #27 #58
Consistency 9.6 9.6
Tests Correct
Attempt pass rate 74.1% 51.9%
Flaky tests 1 1
Total Runs 54 54
Cost per result 0.000 0.217
Total Cost $0.000 $0.020
Input Price $0.000 / 1M $0.650 / 1M
Output Price $0.000 / 1M $2.080 / 1M
Output Tokens 65,057 1,959
Reasoning Tokens 0 0
Response Time (avg) 14.63s 4.23s
Response Time (max) 46.04s 11.07s
Response Time (total) 248.72s 46.51s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 6.59s 5,955 0
GLM 5 4.8 10.0 25.0% 0 2.37s 275 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 31.37s 8,054 0
GLM 5 5.6 3.5 33.3% 1 8.84s 408 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 46.04s 12,018 0
GLM 5 3.0 10.0 0.0% 0 4.98s 406 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 6.5 10.0 50.0% 0 5.25s 930 0
GLM 5 10.0 10.0 100.0% 0 5.78s 203 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 5.3 10.0 33.3% 0 22.30s 22,527 0
GLM 5 3.0 10.0 0.0% 0 2.24s 19 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 16.84s 2,448 0
GLM 5 10.0 10.0 100.0% 0 3.27s 103 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 6.16s 2,967 0
GLM 5 10.0 10.0 100.0% 0 1.48s 61 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 5.3 7.4 44.4% 1 9.55s 7,062 0
GLM 5 7.7 10.0 66.7% 0 2.05s 264 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
HY3 Preview 10.0 10.0 100.0% 0 15.02s 3,096 0
GLM 5 10.0 10.0 100.0% 0 11.07s 220 0

Quick Compare

Switch Comparison Pair