Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Google: Gemini 2.5 Flash

Last updated at: 2026-04-29

Metric Seed-2.0-Lite Seed-2.0-Lite none Release: 2026-02-14 Gemini 2.5 Flash Gemini 2.5 Flash none Release: 2025-06-17
Score 6.2 6.2
Rank #78 #79
Reliability N/A N/A
Consistency 7.7 9.2
Tests Correct
Attempt pass rate 55.6% 44.4%
Flaky tests 5 2
Total Runs 54 54
Cost per result 0.200 0.184
Total Cost $0.016 $0.013
Input Price $0.250 / 1M $0.300 / 1M
Output Price $2.000 / 1M $2.500 / 1M
Output Tokens 3,129 1,726
Reasoning Tokens 0 0
Response Time (avg) 2.53s 903ms
Response Time (max) 6.70s 4.39s
Response Time (total) 45.46s 16.26s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.0 5.9 16.7% 2 2.43s 709 0
Gemini 2.5 Flash 3.0 10.0 0.0% 0 582ms 102 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 4.61s 380 0
Gemini 2.5 Flash 10.0 10.0 100.0% 0 1.16s 453 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.0 10.0 0.0% 0 6.59s 498 0
Gemini 2.5 Flash 3.0 10.0 0.0% 0 4.39s 366 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.82s 246 0
Gemini 2.5 Flash 10.0 10.0 100.0% 0 652ms 279 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.6 7.2 22.2% 1 1.33s 17 0
Gemini 2.5 Flash 5.9 7.2 55.6% 1 495ms 12 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.45s 294 0
Gemini 2.5 Flash 5.0 10.0 0.0% 0 615ms 78 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.06s 73 0
Gemini 2.5 Flash 8.0 6.8 66.7% 1 672ms 70 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 5.2 4.4 55.6% 2 2.46s 620 0
Gemini 2.5 Flash 5.7 10.0 33.3% 0 576ms 132 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.94s 292 0
Gemini 2.5 Flash 10.0 10.0 100.0% 0 1.91s 234 0

Quick Compare

Switch Comparison Pair