Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite Preview vs OpenAI: GPT-5 Mini

Last updated at: 2026-06-04

Metric Gemini 3.1 Flash Lite Preview Gemini 3.1 Flash Lite Preview none Release: 2026-03-03 GPT-5 Mini GPT-5 Mini medium Release: 2025-08-07
Score 7.2 7.3
Rank #58 #54
Reliability 10.0 10.0
Consistency 9.7 9.1
Tests Correct
Attempt pass rate 60.3% 63.5%
Flaky tests 1 2
Total Runs 63 63
Cost per result 0.148 1.319
Total Cost $0.018 $0.159
Input Price $0.250 / 1M $0.250 / 1M
Output Price $1.500 / 1M $2.000 / 1M
Total Input Tokens 37,582 37,100
Output Tokens 5,547 6,801
Reasoning Tokens 0 67,690
Response Time (avg) 1.21s 23.64s
Response Time (max) 3.39s 88.15s
Response Time (total) 25.45s 496.44s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 7.5 8.4 66.7% 1 1.04s 504 1,092 0
GPT-5 Mini 7.1 7.6 66.7% 1 13.86s 606 1,715 6,378
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 5.5 10.0 33.3% 0 967ms 8,128 670 0
GPT-5 Mini 10.0 10.0 100.0% 0 27.63s 7,302 658 17,152
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 3.0 10.0 0.0% 0 3.20s 13,026 339 0
GPT-5 Mini 10.0 10.0 100.0% 0 88.15s 14,118 754 11,520
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.22s 7,550 399 0
GPT-5 Mini 10.0 10.0 100.0% 0 12.58s 7,140 453 3,200
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 5.3 10.0 33.3% 0 942ms 641 568 0
GPT-5 Mini 3.6 7.2 22.2% 1 44.63s 515 293 14,016
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 4.0 10.0 0.0% 0 741ms 488 69 0
GPT-5 Mini 4.5 10.0 0.0% 0 13.50s 477 349 1,856
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.13s 623 574 0
GPT-5 Mini 10.0 10.0 100.0% 0 11.59s 660 310 3,968
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 900ms 570 1,045 0
GPT-5 Mini 5.6 9.8 33.3% 0 15.20s 642 1,622 6,144
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 3.39s 5,894 782 0
GPT-5 Mini 10.0 10.0 100.0% 0 18.64s 5,445 487 1,600
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 3.0 10.0 0.0% 0 814ms 158 9 0
GPT-5 Mini 3.0 10.0 0.0% 0 9.99s 195 160 1,856

Quick Compare

Switch Comparison Pair