Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Google: Gemini 3.5 Flash

Last updated at: 2026-05-19

Metric Seed-2.0-Lite Seed-2.0-Lite medium Release: 2026-02-14 Gemini 3.5 Flash Gemini 3.5 Flash none Release: 2026-05-19
Score 8.3 9.1
Rank #15 #6
Reliability 10.0 10.0
Consistency 8.9 9.0
Tests Correct
Attempt pass rate 79.0% 91.7%
Flaky tests 3 2
Total Runs 57 57
Cost per result 0.958 3.490
Total Cost $0.125 $0.489
Input Price $0.250 / 1M $1.500 / 1M
Output Price $2.000 / 1M $9.000 / 1M
Output Tokens 3,266 53,202
Reasoning Tokens 54,082 0
Response Time (avg) 31.32s 5.59s
Response Time (max) 168.71s 14.88s
Response Time (total) 595.04s 89.50s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 8.3 10.0 75.0% 0 17.99s 996 7,142
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.53s 5,101 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 74.49s 436 7,319
Gemini 3.5 Flash 10.0 10.0 100.0% 0 14.88s 11,611 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 37.67s 506 4,299
Gemini 3.5 Flash 0.0 0.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 9.07s 246 1,742
Gemini 3.5 Flash 10.0 10.0 100.0% 0 8.10s 5,895 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 5.9 7.2 55.6% 1 88.74s 15 23,897
Gemini 3.5 Flash 7.6 7.2 77.8% 1 10.64s 17,910 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 6.7 3.6 66.7% 1 18.25s 304 1,620
Gemini 3.5 Flash 10.0 10.0 100.0% 0 3.46s 1,620 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 7.26s 71 1,480
Gemini 3.5 Flash 9.8 10.0 100.0% 0 3.38s 3,928 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 9.0 7.9 88.9% 1 11.03s 461 3,532
Gemini 3.5 Flash 10.0 10.0 100.0% 0 3.13s 4,640 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 12.38s 222 1,011
Gemini 3.5 Flash 0.0 0.0 0.0% 0 0ms 0 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.0 10.0 0.0% 0 48.32s 9 2,040
Gemini 3.5 Flash 2.8 1.6 33.3% 1 4.87s 2,497 0

Quick Compare

Switch Comparison Pair