Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Z.ai: GLM 5V Turbo

Last updated at: 2026-05-08

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite low Release: 2026-05-08 GLM 5V Turbo GLM 5V Turbo medium Release: 2026-04-01
Score 7.6 7.5
Rank #44 #49
Reliability 10.0 10.0
Consistency 9.2 7.6
Tests Correct
Attempt pass rate 68.4% 73.7%
Flaky tests 2 6
Total Runs 57 57
Cost per result 0.203 2.919
Total Cost $0.025 $0.322
Input Price $0.250 / 1M $1.200 / 1M
Output Price $1.500 / 1M $4.000 / 1M
Output Tokens 2,702 2,373
Reasoning Tokens 8,596 66,463
Response Time (avg) 1.92s 16.33s
Response Time (max) 5.66s 67.08s
Response Time (total) 36.49s 310.29s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 7.3 6.2 75.0% 2 1.84s 1,013 1,548
GLM 5V Turbo 7.2 6.1 75.0% 2 10.76s 587 7,872
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.46s 441 408
GLM 5V Turbo 10.0 10.0 100.0% 0 13.78s 404 4,628
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 4.48s 348 975
GLM 5V Turbo 6.9 3.8 66.7% 1 15.06s 403 2,523
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.44s 291 697
GLM 5V Turbo 10.0 10.0 100.0% 0 9.60s 236 4,333
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 5.3 10.0 33.3% 0 1.52s 15 1,214
GLM 5V Turbo 5.3 7.2 44.4% 1 38.15s 32 29,035
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 1.37s 69 438
GLM 5V Turbo 10.0 10.0 100.0% 0 11.09s 131 2,183
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.52s 72 760
GLM 5V Turbo 9.9 10.0 100.0% 0 3.74s 72 1,813
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.40s 210 1,191
GLM 5V Turbo 7.6 7.2 77.8% 1 10.91s 193 5,789
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 5.66s 234 945
GLM 5V Turbo 7.0 3.7 66.7% 1 12.53s 293 765
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 1.46s 9 420
GLM 5V Turbo 3.0 10.0 0.0% 0 40.96s 22 7,522

Quick Compare

Switch Comparison Pair