Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemma 4 26B A4B vs OpenAI: GPT-5.5

Last updated at: 2026-05-19

Metric Gemma 4 26B A4B Gemma 4 26B A4B medium Release: 2026-04-03 Free Available GPT-5.5 GPT-5.5 low Release: 2026-04-24
Score 7.7 8.9
Rank #43 #10
Reliability 10.0 10.0
Consistency 8.7 10.0
Tests Correct
Attempt pass rate 73.7% 84.2%
Flaky tests 3 0
Total Runs 57 57
Cost per result 0.260 4.412
Total Cost $0.034 $0.706
Input Price $0.060 / 1M $5.000 / 1M
Output Price $0.330 / 1M $30.000 / 1M
Output Tokens 16,725 2,008
Reasoning Tokens 61,536 16,914
Response Time (avg) 33.69s 8.80s
Response Time (max) 180.87s 56.19s
Response Time (total) 606.35s 167.26s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 6.20s 1,142 3,045
GPT-5.5 10.0 10.0 100.0% 0 4.43s 246 1,011
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 2.8 10.0 0.0% 0 147.47s 3,516 4,676
GPT-5.5 10.0 10.0 100.0% 0 7.79s 369 936
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 9.6 10.0 100.0% 0 73.55s 5,415 13,112
GPT-5.5 10.0 10.0 100.0% 0 9.56s 303 717
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 16.51s 1,567 2,827
GPT-5.5 10.0 10.0 100.0% 0 3.28s 228 157
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 2.9 4.4 22.2% 2 23.62s 2,469 7,105
GPT-5.5 5.3 10.0 33.3% 0 27.57s 69 11,731
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 29.76s 25 5,075
GPT-5.5 10.0 10.0 100.0% 0 7.14s 146 170
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 17.54s 887 4,470
GPT-5.5 9.9 10.0 100.0% 0 2.98s 93 356
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 7.7 7.3 77.8% 1 8.52s 457 3,065
GPT-5.5 10.0 10.0 100.0% 0 4.94s 274 895
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 9.01s 450 1,256
GPT-5.5 10.0 10.0 100.0% 0 4.96s 250 101
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 180.87s 797 16,905
GPT-5.5 3.0 10.0 0.0% 0 10.06s 30 840

Quick Compare

Switch Comparison Pair