Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Mini vs Google: Gemini 3.1 Flash Lite Preview

Summary

Seed-2.0-Mini vs Gemini 3.1 Flash Lite Preview benchmark comparison: Gemini 3.1 Flash Lite Preview leads on average score with 7.2 vs 6.9. Gemini 3.1 Flash Lite Preview has the lower benchmark cost at $0.018 vs $0.044. Gemini 3.1 Flash Lite Preview is faster at 1.21s vs 80.22s, with pass rates of 57.1% vs 60.3%.

Recommended model: Gemini 3.1 Flash Lite Preview - It has the best score here (7.2), while costing about 2.5x less than Seed-2.0-Mini.

Last updated at: 2026-06-04

Metric Seed-2.0-Mini Seed-2.0-Mini medium Release: 2026-02-14 Gemini 3.1 Flash Lite Preview Gemini 3.1 Flash Lite Preview none Release: 2026-03-03
Score 6.9 7.2
Rank #73 #58
Reliability 6.7 10.0
Consistency 9.3 9.7
Tests Correct
Attempt pass rate 57.1% 60.3%
Flaky tests 2 1
Total Runs 63 63
Cost per result 0.397 0.148
Total Cost $0.044 $0.018
Input Price $0.100 / 1M $0.250 / 1M
Output Price $0.400 / 1M $1.500 / 1M
Total Input Tokens 41,904 37,582
Output Tokens 2,555 5,547
Reasoning Tokens 95,974 0
Response Time (avg) 80.22s 1.21s
Response Time (max) 262.83s 3.39s
Response Time (total) 1363.72s 25.45s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#73 Seed-2.0-Mini

medium
Cost
$0.002
Time
161.7s
Tokens
4,379 tok

#58 Gemini 3.1 Flash Lite Preview

none
Cost
$0.003
Time
4.7s
Tokens
1,827 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 6.6 10.0 50.0% 0 74.75s 791 360 9,520
Gemini 3.1 Flash Lite Preview 7.5 8.4 66.7% 1 1.04s 504 1,092 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.5 9.8 33.3% 0 220.48s 3,823 464 34,964
Gemini 3.1 Flash Lite Preview 5.5 10.0 33.3% 0 967ms 8,128 670 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 262.83s 16,533 404 29,806
Gemini 3.1 Flash Lite Preview 3.0 10.0 0.0% 0 3.20s 13,026 339 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 24.27s 8,568 246 2,743
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.22s 7,550 399 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 0ms 0 0 0
Gemini 3.1 Flash Lite Preview 5.3 10.0 33.3% 0 942ms 641 568 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.1 3.4 33.3% 1 36.65s 585 213 4,210
Gemini 3.1 Flash Lite Preview 4.0 10.0 0.0% 0 741ms 488 69 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 17.47s 840 69 2,050
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.13s 623 574 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 8.2 7.2 88.9% 1 31.79s 903 527 5,667
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 900ms 570 1,045 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 88.68s 9,585 222 5,235
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 3.39s 5,894 782 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 56.76s 276 50 1,779
Gemini 3.1 Flash Lite Preview 3.0 10.0 0.0% 0 814ms 158 9 0

Quick Compare

Switch Comparison Pair