Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Qwen: Qwen3 Coder Next vs Z.ai: GLM 5 Turbo

Last updated at: 2026-03-15

Metric Qwen3 Coder Next Qwen3 Coder Next medium Release: 2026-02-03 GLM 5 Turbo GLM 5 Turbo none Release: 2026-03-15
Rank #63 #53
Score 4.9 5.7
Consistency 9.1 9.5
Cost per result 0.230 0.467
Total Cost $0.007 $0.028
Tests Correct
Attempt pass rate 27.1% 39.6%
Flaky tests 2 1
Total Runs 48 48
Output Tokens 2,935 1,264
Reasoning Tokens 0 0
Response Time (avg) 12.53s 2.92s
Response Time (max) 81.80s 8.21s
Response Time (total) 125.32s 46.72s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 3.6 7.5 22.2% 1 15.28s 1,246 0
GLM 5 Turbo 3.0 10.0 0.0% 0 3.01s 376 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 3.0 10.0 0.0% 0 4.28s 317 0
GLM 5 Turbo 3.0 10.0 0.0% 0 4.89s 144 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 6.5 10.0 50.0% 0 81.80s 246 0
GLM 5 Turbo 10.0 10.0 100.0% 0 2.47s 204 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 5.3 10.0 33.3% 0 638ms 25 0
GLM 5 Turbo 5.3 10.0 33.3% 0 1.97s 25 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 6.3 3.4 66.7% 1 1.39s 142 0
GLM 5 Turbo 4.2 9.9 0.0% 0 2.18s 48 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 4.8 10.0 0.0% 0 7.34s 63 0
GLM 5 Turbo 6.5 10.0 50.0% 0 2.13s 65 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 3.1 10.0 0.0% 0 2.30s 641 0
GLM 5 Turbo 5.5 7.4 44.4% 1 2.43s 180 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Qwen3 Coder Next 10.0 10.0 100.0% 0 2.64s 255 0
GLM 5 Turbo 10.0 10.0 100.0% 0 8.21s 222 0

Quick Compare

Switch Comparison Pair