Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Qwen: Qwen3.6 Max Preview vs Z.ai: GLM 5 Turbo

Last updated at: 2026-05-08

Metric Qwen3.6 Max Preview Qwen3.6 Max Preview medium Release: 2026-04-20 GLM 5 Turbo GLM 5 Turbo medium Release: 2026-03-15
Score 8.5 8.1
Rank #9 #20
Reliability 10.0 6.7
Consistency 9.6 8.4
Tests Correct
Attempt pass rate 80.7% 77.2%
Flaky tests 1 4
Total Runs 57 57
Cost per result 5.808 1.438
Total Cost $0.872 $0.187
Input Price $1.040 / 1M $1.200 / 1M
Output Price $6.240 / 1M $4.000 / 1M
Output Tokens 2,186 12,217
Reasoning Tokens 105,156 40,252
Response Time (avg) 48.96s 18.85s
Response Time (max) 186.74s 194.23s
Response Time (total) 930.20s 358.15s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 22.13s 228 10,075
GLM 5 Turbo 10.0 10.0 100.0% 0 4.82s 362 3,137
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 117.87s 368 13,790
GLM 5 Turbo 10.0 10.0 100.0% 0 12.26s 332 3,301
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 121.49s 390 14,575
GLM 5 Turbo 10.0 10.0 100.0% 0 13.88s 390 2,037
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 41.15s 270 10,106
GLM 5 Turbo 10.0 10.0 100.0% 0 6.19s 577 3,632
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 2.9 7.2 11.1% 1 95.91s 60 30,371
GLM 5 Turbo 2.9 4.4 22.2% 2 71.07s 9,665 19,279
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 32.24s 129 3,510
GLM 5 Turbo 6.1 3.1 66.7% 1 10.05s 60 2,216
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 24.31s 103 5,848
GLM 5 Turbo 10.0 10.0 100.0% 0 5.38s 255 2,183
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 24.19s 301 7,649
GLM 5 Turbo 8.7 7.9 77.8% 1 5.44s 315 2,702
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 18.32s 309 1,571
GLM 5 Turbo 10.0 10.0 100.0% 0 9.84s 241 446
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 60.56s 28 7,661
GLM 5 Turbo 3.0 10.0 0.0% 0 40.17s 20 1,319

Quick Compare

Switch Comparison Pair