Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Mini vs Google: Gemini 3 Flash Preview

Summary

Seed-2.0-Mini vs Gemini 3 Flash Preview benchmark comparison: Seed-2.0-Mini leads on average score with 7.4 vs 6.9. Gemini 3 Flash Preview has the lower benchmark cost at $0.025 vs $0.044. Gemini 3 Flash Preview is faster at 1.65s vs 80.22s, with pass rates of 57.1% vs 66.7%.

Recommended model: Gemini 3 Flash Preview - Its score stays close to the best score here (6.9 vs 7.4), while costing about 1.8x less than Seed-2.0-Mini.

Last updated at: 2026-06-12

Metric Seed-2.0-Mini Seed-2.0-Mini medium Release: 2026-02-14 Gemini 3 Flash Preview Gemini 3 Flash Preview none Release: 2025-12-17
Score 7.4 6.9
Rank #51 #68
Reliability 6.7 10.0
Consistency 9.3 9.2
Tests Correct
Attempt pass rate 57.1% 66.7%
Flaky tests 2 2
Total Runs 63 63
Cost per result 0.397 0.186
Total Cost $0.044 $0.025
Input Price $0.100 / 1M $0.500 / 1M
Output Price $0.400 / 1M $3.000 / 1M
Total Input Tokens 41,904 37,011
Output Tokens 2,555 1,885
Reasoning Tokens 95,974 0
Response Time (avg) 80.22s 1.65s
Response Time (max) 262.83s 3.56s
Response Time (total) 1363.72s 23.07s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#51 Seed-2.0-Mini

medium
Cost
$0.002
Time
161.7s
Tokens
4,379 tok

#68 Gemini 3 Flash Preview

none
Cost
$0.003
Time
5.6s
Tokens
981 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 6.6 10.0 50.0% 0 74.75s 791 360 9,520
Gemini 3 Flash Preview 8.3 10.0 75.0% 0 1.25s 498 214 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.5 9.8 33.3% 0 220.48s 3,823 464 34,964
Gemini 3 Flash Preview 5.5 10.0 33.3% 0 1.80s 8,122 453 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 262.83s 16,533 404 29,806
Gemini 3 Flash Preview 4.7 1.6 66.7% 1 3.56s 12,862 350 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 24.27s 8,568 246 2,743
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 1.41s 7,263 279 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 0ms 0 0 0
Gemini 3 Flash Preview 7.7 10.0 66.7% 0 963ms 643 18 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.1 3.4 33.3% 1 36.65s 585 213 4,210
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 1.13s 490 104 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 17.47s 840 69 2,050
Gemini 3 Flash Preview 6.4 5.8 66.7% 1 1.58s 619 74 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 8.2 7.2 88.9% 1 31.79s 903 527 5,667
Gemini 3 Flash Preview 7.7 10.0 66.7% 0 1.05s 574 144 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 88.68s 9,585 222 5,235
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.35s 5,784 234 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 56.76s 276 50 1,779
Gemini 3 Flash Preview 3.0 10.0 0.0% 0 1.07s 156 15 0

Quick Compare

Switch Comparison Pair