Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.5 Flash vs Z.ai: GLM 5.2

Summary

Gemini 3.5 Flash vs GLM 5.2 benchmark comparison: GLM 5.2 leads on average score with 7.1 vs 6.8. GLM 5.2 has the lower benchmark cost at $0.076 vs $0.108. Gemini 3.5 Flash is faster at 1.57s vs 6.34s, with pass rates of 68.3% vs 60.3%.

Recommended model: Gemini 3.5 Flash - Its score stays close to the best score here (6.8 vs 7.1), while responding about 4.0x faster than GLM 5.2.

Last updated at: 2026-06-17

Metric Gemini 3.5 Flash Gemini 3.5 Flash minimal Release: 2026-05-19 GLM 5.2 GLM 5.2 none Release: 2026-06-17
Score 6.8 7.1
Rank #71 #61
Reliability 10.0 9.9
Consistency 9.6 9.6
Tests Correct
Attempt pass rate 68.3% 60.3%
Flaky tests 1 1
Total Runs 63 63
Cost per result 0.767 0.628
Total Cost $0.108 $0.076
Input Price $1.500 / 1M $1.400 / 1M
Output Price $9.000 / 1M $4.400 / 1M
Total Input Tokens 39,847 38,671
Output Tokens 5,277 4,817
Reasoning Tokens 0 0
Response Time (avg) 1.57s 6.34s
Response Time (max) 5.51s 20.69s
Response Time (total) 33.02s 133.19s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#71 Gemini 3.5 Flash

minimal
Cost
$0.041
Time
20.4s
Tokens
4,608 tok

#61 GLM 5.2

none
Invalid SVG
Cost
$0.033
Time
87.7s
Tokens
7,455 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 6.5 10.0 50.0% 0 892ms 492 405 0
GLM 5.2 8.3 10.0 75.0% 0 3.70s 567 313 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 5.6 9.9 33.3% 0 2.75s 8,122 3,456 0
GLM 5.2 3.7 9.5 0.0% 0 7.55s 7,263 1,958 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 3.0 10.0 0.0% 0 3.56s 15,780 404 0
GLM 5.2 10.0 10.0 100.0% 0 20.69s 14,296 1,489 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.66s 7,548 279 0
GLM 5.2 10.0 10.0 100.0% 0 7.17s 7,113 204 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 899ms 633 12 0
GLM 5.2 5.3 10.0 33.3% 0 6.50s 696 27 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 922ms 486 117 0
GLM 5.2 6.1 3.1 66.7% 1 4.42s 480 82 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 6.4 5.8 66.7% 1 893ms 615 76 0
GLM 5.2 9.8 10.0 100.0% 0 3.84s 642 66 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.45s 558 282 0
GLM 5.2 7.7 10.0 66.7% 0 3.31s 618 265 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.79s 5,457 234 0
GLM 5.2 10.0 10.0 100.0% 0 15.76s 6,807 400 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.5 Flash 3.0 10.0 0.0% 0 1.76s 156 12 0
GLM 5.2 3.0 10.0 0.0% 0 3.41s 189 13 0

Quick Compare

Switch Comparison Pair