Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

DeepSeek: DeepSeek V4 Pro vs Z.ai: GLM 4.7 Flash

Last updated at: 2026-05-21

Metric DeepSeek V4 Pro DeepSeek V4 Pro none Release: 2026-04-24 GLM 4.7 Flash GLM 4.7 Flash none Release: 2026-01-19
Score 6.2 5.8
Rank #93 #106
Reliability 8.0 10.0
Consistency 8.8 8.7
Tests Correct
Attempt pass rate 50.9% 40.4%
Flaky tests 3 3
Total Runs 57 57
Cost per result 0.542 0.049
Total Cost $0.044 $0.003
Input Price $0.435 / 1M $0.060 / 1M
Output Price $0.870 / 1M $0.400 / 1M
Output Tokens 5,330 2,498
Reasoning Tokens 0 0
Response Time (avg) 14.09s 3.13s
Response Time (max) 58.65s 7.05s
Response Time (total) 267.72s 37.60s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 3.5 8.0 16.7% 1 14.02s 704 0
GLM 4.7 Flash 5.2 7.9 41.7% 1 5.51s 438 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 7.1 3.7 66.7% 1 14.69s 510 0
GLM 4.7 Flash 6.4 9.9 0.0% 0 5.57s 626 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 9.5 10.0 100.0% 0 25.49s 1,911 0
GLM 4.7 Flash 3.0 10.0 0.0% 0 3.22s 704 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 8.8 10.0 100.0% 0 30.54s 170 0
GLM 4.7 Flash 7.3 5.8 83.3% 1 4.82s 196 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 5.3 10.0 33.3% 0 3.17s 18 0
GLM 4.7 Flash 7.7 10.0 66.7% 0 744ms 19 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 4.3 9.9 0.0% 0 3.75s 132 0
GLM 4.7 Flash 4.0 10.0 0.0% 0 1.59s 134 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 6.3 10.0 50.0% 0 8.23s 64 0
GLM 4.7 Flash 6.5 10.0 50.0% 0 888ms 62 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 7.6 7.2 77.8% 1 19.72s 175 0
GLM 4.7 Flash 6.4 10.0 33.3% 0 1.00s 98 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 10.0 10.0 100.0% 0 5.92s 219 0
GLM 4.7 Flash 2.8 1.6 33.3% 1 7.05s 212 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 3.0 10.0 0.0% 0 15.59s 1,427 0
GLM 4.7 Flash 3.0 10.0 0.0% 0 692ms 9 0

Quick Compare

Switch Comparison Pair