AI BENCHY
Compare
❤️ Made by XCS

Model Name

Google: Gemini 3 Flash Preview

Last updated at : Feb 19, 2026

Metric Google: Gemini 3 Flash Preview
Rank#10
CompanyGoogle
Score 6.25
Consistency 8.60
Cost per result 0.0754
Total Cost $0.00528
Tests Correct 7/12
Attempt pass rate 66.7%
Flaky tests 2
Output Tokens 485
Reasoning Tokens 0

Category Breakdown

Category Fully passed tests Score Consistency Attempt pass rate Flaky tests Reasoning score Cost
Anti-AI Tricks 1/2 5.50 10.00 50.0% 0 - $0.00016
Data parsing and extraction 1/2 5.50 5.81 83.3% 1 - $0.00357
Domain specific 2/3 7.00 10.00 66.7% 0 - $0.00038
Instructions following 1/2 5.50 5.81 66.7% 1 - $0.00054
Puzzle Solving 2/3 7.00 10.00 66.7% 0 - $0.00066

Compared models

Compare Google: Gemini 3 Flash Preview against...

#9 · MoonshotAI

MoonshotAI: Kimi K2.5

Reasoning (medium)

Score: 6.42

Consistency: 8.00

Attempt pass rate: 72.2%

Flaky tests: 3

Cost per result: 2.4097

Tests Correct: 7/12

Total Cost: $0.16868

Compare

#11 · OpenAI

OpenAI: GPT-5 Nano

Reasoning (medium)

Score: 5.92

Consistency: 6.03

Attempt pass rate: 72.2%

Flaky tests: 6

Cost per result: 0.4675

Tests Correct: 6/12

Total Cost: $0.02806

Compare

#8 · X Ai

xAI: Grok 4.1 Fast

Reasoning (medium)

Score: 6.42

Consistency: 8.60

Attempt pass rate: 66.7%

Flaky tests: 2

Cost per result: 0.4800

Tests Correct: 7/12

Total Cost: $0.03360

Compare

Quick Compare

Compare Google: Gemini 3 Flash Preview against...