Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Z.ai: GLM 5 Turbo

Last updated at: 2026-05-08

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite low Release: 2026-05-08 GLM 5 Turbo GLM 5 Turbo medium Release: 2026-03-15
Score 7.6 8.1
Rank #44 #20
Reliability 10.0 6.7
Consistency 9.2 8.4
Tests Correct
Attempt pass rate 68.4% 77.2%
Flaky tests 2 4
Total Runs 57 57
Cost per result 0.203 1.438
Total Cost $0.025 $0.187
Input Price $0.250 / 1M $1.200 / 1M
Output Price $1.500 / 1M $4.000 / 1M
Output Tokens 2,702 12,217
Reasoning Tokens 8,596 40,252
Response Time (avg) 1.92s 18.85s
Response Time (max) 5.66s 194.23s
Response Time (total) 36.49s 358.15s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 7.3 6.2 75.0% 2 1.84s 1,013 1,548
GLM 5 Turbo 10.0 10.0 100.0% 0 4.82s 362 3,137
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.46s 441 408
GLM 5 Turbo 10.0 10.0 100.0% 0 12.26s 332 3,301
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 4.48s 348 975
GLM 5 Turbo 10.0 10.0 100.0% 0 13.88s 390 2,037
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.44s 291 697
GLM 5 Turbo 10.0 10.0 100.0% 0 6.19s 577 3,632
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 5.3 10.0 33.3% 0 1.52s 15 1,214
GLM 5 Turbo 2.9 4.4 22.2% 2 71.07s 9,665 19,279
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 1.37s 69 438
GLM 5 Turbo 6.1 3.1 66.7% 1 10.05s 60 2,216
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.52s 72 760
GLM 5 Turbo 10.0 10.0 100.0% 0 5.38s 255 2,183
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.40s 210 1,191
GLM 5 Turbo 8.7 7.9 77.8% 1 5.44s 315 2,702
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 5.66s 234 945
GLM 5 Turbo 10.0 10.0 100.0% 0 9.84s 241 446
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 1.46s 9 420
GLM 5 Turbo 3.0 10.0 0.0% 0 40.17s 20 1,319

Quick Compare

Switch Comparison Pair