Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3 Flash Preview vs OpenAI: GPT-5.5

Last updated at: 2026-05-10

Metric Gemini 3 Flash Preview Gemini 3 Flash Preview medium Release: 2025-12-17 GPT-5.5 GPT-5.5 low Release: 2026-04-24
Score 10.0 8.9
Rank #1 #6
Reliability 10.0 10.0
Consistency 10.0 10.0
Tests Correct
Attempt pass rate 100.0% 84.2%
Flaky tests 0 0
Total Runs 57 57
Cost per result 1.722 4.412
Total Cost $0.328 $0.706
Input Price $0.500 / 1M $5.000 / 1M
Output Price $3.000 / 1M $30.000 / 1M
Output Tokens 1,985 2,008
Reasoning Tokens 102,122 16,914
Response Time (avg) 11.43s 8.80s
Response Time (max) 74.66s 56.19s
Response Time (total) 217.10s 167.26s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.88s 330 3,216
GPT-5.5 10.0 10.0 100.0% 0 4.43s 246 1,011
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 74.66s 432 48,771
GPT-5.5 10.0 10.0 100.0% 0 7.79s 369 936
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 22.42s 351 10,485
GPT-5.5 10.0 10.0 100.0% 0 9.56s 303 717
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.43s 279 4,893
GPT-5.5 10.0 10.0 100.0% 0 3.28s 228 157
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 15.27s 12 21,684
GPT-5.5 5.3 10.0 33.3% 0 27.57s 69 11,731
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.19s 72 1,905
GPT-5.5 10.0 10.0 100.0% 0 7.14s 146 170
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.04s 72 2,709
GPT-5.5 9.9 10.0 100.0% 0 2.98s 93 356
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.48s 192 4,647
GPT-5.5 10.0 10.0 100.0% 0 4.94s 274 895
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 12.60s 234 1,487
GPT-5.5 10.0 10.0 100.0% 0 4.96s 250 101
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.50s 11 2,325
GPT-5.5 3.0 10.0 0.0% 0 10.06s 30 840

Quick Compare

Switch Comparison Pair