Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

xAI: Grok 4.1 Fast vs GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT

Last updated at: 2026-04-02

Metric Grok 4.1 Fast Grok 4.1 Fast none Release: 2025-11-19 GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT none Release: Unknown release date
Score 4.4 3.0
Rank #84 #88
Consistency 9.0 10.0
Tests Correct
Attempt pass rate 23.5% 0.0%
Flaky tests 2 0
Total Runs 51 48
Cost per result 0.251 0.000
Total Cost $0.008 $0.000
Input Price $0.200 / 1M $0.000 / 1M
Output Price $0.500 / 1M $0.000 / 1M
Output Tokens 1,154 0
Reasoning Tokens 0 0
Response Time (avg) 1.76s 0ms
Response Time (max) 5.51s 0ms
Response Time (total) 17.56s 0ms

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 3.2 10.0 0.0% 0 1.07s 235 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 3.0 10.0 0.0% 0 3.33s 105 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 10.0 10.0 100.0% 0 943ms 180 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 5.9 7.2 55.6% 1 1.06s 15 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 4.4 9.9 0.0% 0 1.08s 112 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 3.0 10.0 0.0% 0 923ms 56 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 3.2 10.0 0.0% 0 1.28s 243 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Grok 4.1 Fast 2.8 1.6 33.3% 1 5.51s 208 0
GLM 5v Turbo X Ai/grok 4.20 Google/gemma 4 31b IT 3.0 10.0 0.0% 0 0ms 0 0

Quick Compare

Switch Comparison Pair