#24
Google
Release: 2026-03-03
Tested on: 2026-04-11 01:44
google/gemini-3.1-flash-lite-preview::low
Identity note
Gemini 3.1 Flash Lite Preview was the preview version of Google: Gemini 3.1 Flash Lite.
Input Price
$0.250 / 1M
Output Price
$1.500 / 1M
Flaky tests
0
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Run history
| Tested on | Score | Reliability | Tests Correct | Total Cost | Compare |
|---|---|---|---|---|---|
| 2026-05-22 00:27 Suite changed | 7.6 | 10.0 | $0.025 | Compare | |
| 2026-04-11 01:44 First recorded run | 8.1 | N/A | $0.022 | Current run |
Run comparison
| Run | Score | Consistency | Reliability | Tests Correct | Flaky tests | Total Output Tokens | Total Cost | Response Time (avg) |
|---|---|---|---|---|---|---|---|---|
| 2026-04-11 01:44 · First recorded run | 8.1 | 10.0 | N/A | 13/18 | 0 | 10,305 | $0.022 | 3.22s |
| 2026-05-22 00:27 · Suite changed | 7.6 | 10.0 | 10.0 | 13/20 | 0 | 11,109 | $0.025 | 3.01s |
| Difference | +0.5 | 0.0 | 0 | 0 | -804 | -$0.003 | +208ms |
These two runs used different benchmark suites, so the deltas reflect both model changes and suite changes.
Charts
Choose the first model, then click a second model to open a side-by-side page.
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Quick Compare
Gemini 3.1 Flash Lite PreviewlowvsGemini 3.1 Flash LitemediumGemini 3.1 Flash Lite PreviewlowvsQwen3.5-122B-A10BmediumGemini 3.1 Flash Lite PreviewlowvsGemini 3.1 Flash Lite PreviewmediumGemini 3.1 Flash Lite PreviewlowvsGemini 3 Flash PreviewnoneGemini 3.1 Flash Lite PreviewlowvsQwen3.6 PlusmediumGemini 3.1 Flash Lite PreviewlowvsGrok Build 0.1mediumGemini 3.1 Flash Lite PreviewlowvsGemini 3 Flash PreviewmediumGemini 3.1 Flash Lite PreviewlowvsGemini 3.5 FlashhighGemini 3.1 Flash Lite PreviewlowvsRing-2.6-1TmediumGemini 3.1 Flash Lite PreviewlowvsGemini 3.5 Flashlow
Category Breakdown
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 8.3 | 10.0 | |
| Coding | 10.0 | 10.0 | |
| Combined | 3.0 | 10.0 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 5.3 | 10.0 | |
| General Intelligence | 4.0 | 10.0 | |
| Instructions following | 10.0 | 10.0 | |
| Puzzle Solving | 10.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 |