Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Z.ai: GLM 5

Summary

Seed-2.0-Lite vs GLM 5 benchmark comparison: GLM 5 leads on average score with 8.6 vs 8.5. Seed-2.0-Lite has the lower benchmark cost at $0.175 vs $0.228. GLM 5 is faster at 33.54s vs 47.07s, with pass rates of 76.2% vs 82.5%.

Recommended model: GLM 5 - It has the strongest score in this comparison (8.6) and the best overall balance of cost and response time across all 2 models.

Last updated at: 2026-07-02

Metric Seed-2.0-Lite Seed-2.0-Lite medium Release: 2026-02-14 GLM 5 GLM 5 medium Release: 2026-02-12
Score 8.5 8.6
Rank #18 #15
Reliability 10.0 10.0
Consistency 9.0 8.5
Tests Correct
Attempt pass rate 76.2% 82.5%
Flaky tests 3 4
Total Runs 63 63
Cost per result 1.250 1.668
Total Cost $0.175 $0.228
Input Price $0.250 / 1M $0.600 / 1M
Output Price $2.000 / 1M $1.920 / 1M
Total Input Tokens 46,740 35,224
Output Tokens 3,230 21,570
Reasoning Tokens 78,406 102,996
Response Time (avg) 47.07s 33.54s
Response Time (max) 254.92s 99.85s
Response Time (total) 988.37s 435.99s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#18 Seed-2.0-Lite

medium
Cost
$0.005
Time
86.7s
Tokens
2,354 tok

#15 GLM 5

medium
Cost
$0.005
Time
20.7s
Tokens
2,068 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 8.3 10.0 75.0% 0 17.99s 942 996 7,142
GLM 5 10.0 10.0 100.0% 0 23.66s 555 480 7,056
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 8.0 9.8 66.7% 0 156.74s 8,247 458 31,890
GLM 5 10.0 10.0 100.0% 0 74.30s 7,254 2,997 52,930
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 37.67s 16,254 506 4,299
GLM 5 10.0 10.0 100.0% 0 28.96s 12,804 662 3,242
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 9.07s 8,562 246 1,742
GLM 5 7.1 5.6 83.3% 1 8.90s 5,508 567 3,734
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 5.9 7.2 55.6% 1 88.74s 843 15 23,897
GLM 5 3.5 4.4 33.3% 2 0ms 260 13,176 14,137
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 6.7 3.6 66.7% 1 18.25s 582 304 1,620
GLM 5 6.1 3.1 66.7% 1 14.69s 477 2,020 2,248
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 7.26s 834 71 1,480
GLM 5 10.0 10.0 100.0% 0 7.25s 636 1,001 2,129
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 9.0 7.9 88.9% 1 10.23s 894 403 3,285
GLM 5 10.0 10.0 100.0% 0 11.33s 609 33 4,076
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 12.38s 9,306 222 1,011
GLM 5 10.0 10.0 100.0% 0 15.93s 6,935 233 994
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.0 10.0 0.0% 0 48.32s 276 9 2,040
GLM 5 3.0 10.0 0.0% 0 67.37s 186 401 12,450

Quick Compare

Switch Comparison Pair