Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Google: Gemini 3 Flash Preview vs GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT

Last updated at: 2026-04-02

Metric Gemini 3 Flash Preview Gemini 3 Flash Preview medium Release: 2025-12-17 GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT none Release: Unknown release date
Score 10.0 3.0
Rank #1 #88
Consistency 10.0 10.0
Tests Correct
Attempt pass rate 100.0% 0.0%
Flaky tests 0 0
Total Runs 51 48
Cost per result 0.972 0.000
Total Cost $0.166 $0.000
Input Price $0.500 / 1M $0.000 / 1M
Output Price $3.000 / 1M $0.000 / 1M
Output Tokens 1,640 0
Reasoning Tokens 48,270 0
Response Time (avg) 11.39s 0ms
Response Time (max) 50.16s 0ms
Response Time (total) 113.86s 0ms

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.13s 305 3,490
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 50.16s 351 12,645
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.72s 279 5,333
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 21.12s 12 14,908
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.09s 111 1,285
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 6.10s 72 4,558
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.43s 276 4,921
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 10.55s 234 1,130
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0

Quick Compare

Switch Comparison Pair