Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemma 4 26B A4B vs xAI: Grok 4.20

Last updated at: 2026-05-19

Metric Gemma 4 26B A4B Gemma 4 26B A4B none Release: 2026-04-03 Free Available Grok 4.20 Grok 4.20 medium Release: 2026-03-31
Score 6.3 6.9
Rank #89 #73
Reliability 10.0 10.0
Consistency 9.1 8.3
Tests Correct
Attempt pass rate 49.1% 63.2%
Flaky tests 2 4
Total Runs 57 57
Cost per result 0.063 7.559
Total Cost $0.005 $0.756
Input Price $0.060 / 1M $1.250 / 1M
Output Price $0.330 / 1M $2.500 / 1M
Output Tokens 1,796 1,784
Reasoning Tokens 0 128,233
Response Time (avg) 6.28s 14.53s
Response Time (max) 57.10s 63.48s
Response Time (total) 119.39s 276.06s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 8.3 10.0 75.0% 0 1.28s 230 0
Grok 4.20 8.2 7.9 83.3% 1 3.95s 287 8,312
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 4.7 1.6 66.7% 1 7.07s 448 0
Grok 4.20 4.3 1.1 66.7% 1 24.33s 250 12,804
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 30.53s 309 0
Grok 4.20 10.0 10.0 100.0% 0 17.40s 232 9,556
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 1.70s 285 0
Grok 4.20 10.0 10.0 100.0% 0 4.17s 180 5,333
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.6 7.2 22.2% 1 2.49s 27 0
Grok 4.20 5.3 10.0 33.3% 0 27.03s 375 49,339
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 4.0 10.0 0.0% 0 3.54s 85 0
Grok 4.20 3.9 2.6 33.3% 1 24.48s 65 6,440
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 6.3 10.0 50.0% 0 1.08s 75 0
Grok 4.20 7.3 6.0 83.3% 1 4.42s 40 5,474
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 6.2 10.0 33.3% 0 739ms 114 0
Grok 4.20 7.7 10.0 66.7% 0 6.20s 149 7,913
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 57.10s 210 0
Grok 4.20 3.0 10.0 0.0% 0 13.68s 197 6,620
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 778ms 13 0
Grok 4.20 3.0 10.0 0.0% 0 63.48s 9 16,442

Quick Compare

Switch Comparison Pair