Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Poolside: Laguna XS 2.1 vs Z.ai: GLM 5

Summary

Laguna XS 2.1 vs GLM 5 benchmark comparison: GLM 5 leads on average score with 6.0 vs 5.3. Laguna XS 2.1 has the lower benchmark cost at $0.003 vs $0.027. Laguna XS 2.1 is faster at 722ms vs 4.03s, with pass rates of 31.8% vs 44.4%.

Recommended model: GLM 5 - It has the strongest score in this comparison (6.0) and the best overall balance of cost and response time across all 2 models.

Last updated at: 2026-07-02

Metric Laguna XS 2.1 Laguna XS 2.1 none Release: 2026-07-02 Free Available GLM 5 GLM 5 none Release: 2026-02-12
Score 5.3 6.0
Rank #128 #104
Reliability 10.0 10.0
Consistency 9.0 9.7
Tests Correct
Attempt pass rate 31.8% 44.4%
Flaky tests 3 1
Total Runs 63 63
Cost per result 0.058 0.263
Total Cost $0.003 $0.027
Input Price $0.060 / 1M $0.600 / 1M
Output Price $0.120 / 1M $1.920 / 1M
Total Input Tokens 41,148 37,135
Output Tokens 3,451 1,989
Reasoning Tokens 0 0
Response Time (avg) 722ms 4.03s
Response Time (max) 2.30s 11.07s
Response Time (total) 15.17s 56.37s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#128 Laguna XS 2.1

none
Cost
$0.001
Time
27.6s
Tokens
4,344 tok

#104 GLM 5

none
Cost
$0.007
Time
32.1s
Tokens
2,023 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 5.3 8.3 33.3% 1 755ms 774 1,015 0
GLM 5 4.8 10.0 25.0% 0 2.37s 510 275 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 4.3 7.8 22.2% 1 623ms 7,995 562 0
GLM 5 4.0 7.8 11.1% 1 5.12s 7,256 428 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 3.0 10.0 0.0% 0 1.76s 14,197 402 0
GLM 5 3.0 10.0 0.0% 0 4.98s 12,812 406 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 10.0 10.0 100.0% 0 768ms 7,734 240 0
GLM 5 10.0 10.0 100.0% 0 5.78s 7,107 203 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 5.3 10.0 33.3% 0 364ms 834 14 0
GLM 5 3.0 10.0 0.0% 0 2.24s 643 19 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 5.0 10.0 0.0% 0 529ms 537 128 0
GLM 5 10.0 10.0 100.0% 0 3.27s 477 103 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 3.8 5.8 33.3% 1 364ms 638 50 0
GLM 5 10.0 10.0 100.0% 0 1.48s 636 61 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 3.0 10.0 0.0% 0 1.01s 771 730 0
GLM 5 7.7 10.0 66.7% 0 1.91s 609 261 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 10.0 10.0 100.0% 0 1.36s 7,413 300 0
GLM 5 10.0 10.0 100.0% 0 11.07s 6,899 220 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Laguna XS 2.1 3.0 10.0 0.0% 0 254ms 255 10 0
GLM 5 3.0 10.0 0.0% 0 3.62s 186 13 0

Quick Compare

Switch Comparison Pair