Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite Preview vs Google: Gemini 3.5 Flash

Last updated at: 2026-06-01

Metric Gemini 3.1 Flash Lite Preview Gemini 3.1 Flash Lite Preview high Release: 2026-03-03 Gemini 3.5 Flash Gemini 3.5 Flash low Release: 2026-05-19
Score 8.6 9.3
Rank #14 #3
Reliability N/A 10.0
Consistency 10.0 10.0
Tests Correct
Attempt pass rate 81.3% 90.0%
Flaky tests 0 0
Total Runs 48 60
Cost per result 17.763 1.582
Total Cost $2.310 $0.285
Input Price $0.250 / 1M $1.500 / 1M
Output Price $1.500 / 1M $9.000 / 1M
Output Tokens 1,283 2,027
Reasoning Tokens 1,533,310 23,938
Response Time (avg) 68.14s 2.98s
Response Time (max) 280.52s 6.44s
Response Time (total) 1090.28s 59.59s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 43.87s 144 193,077
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.52s 209 2,536
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 280.52s 335 380,440
Gemini 3.5 Flash 10.0 10.0 100.0% 0 6.44s 351 3,050
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 7.16s 279 6,186
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.81s 279 1,164
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 5.3 10.0 33.3% 0 127.58s 18 566,202
Gemini 3.5 Flash 7.7 10.0 66.7% 0 3.39s 12 4,538
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 5.25s 117 3,915
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.27s 119 916
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 9.8 10.0 100.0% 0 64.03s 69 190,053
Gemini 3.5 Flash 9.9 10.0 100.0% 0 1.86s 71 1,652
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 7.7 10.0 66.7% 0 46.68s 87 190,953
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.35s 288 2,150
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 7.73s 234 2,484
Gemini 3.5 Flash 10.0 10.0 100.0% 0 3.27s 234 403
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview - - - - - - - -
Gemini 3.5 Flash 6.8 10.0 50.0% 0 5.54s 452 6,839
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview - - - - - - - -
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.88s 12 690

Quick Compare

Switch Comparison Pair