Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs DeepSeek: DeepSeek V4 Flash

Summary

Seed-2.0-Lite vs DeepSeek V4 Flash benchmark comparison: Seed-2.0-Lite leads on average score with 8.2 vs 7.7. DeepSeek V4 Flash has the lower benchmark cost at $0.029 vs $0.175. DeepSeek V4 Flash is faster at 45.85s vs 47.07s, with pass rates of 76.2% vs 74.6%.

Recommended model: DeepSeek V4 Flash - Its score stays close to the best score here (7.7 vs 8.2), while costing about 6.1x less than Seed-2.0-Lite.

Last updated at: 2026-06-10

Metric Seed-2.0-Lite Seed-2.0-Lite medium Release: 2026-02-14 DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24
Score 8.2 7.7
Rank #20 #32
Reliability 10.0 10.0
Consistency 9.0 8.5
Tests Correct
Attempt pass rate 76.2% 74.6%
Flaky tests 3 4
Total Runs 63 63
Cost per result 1.250 0.299
Total Cost $0.175 $0.029
Input Price $0.250 / 1M $0.099 / 1M
Output Price $2.000 / 1M $0.197 / 1M
Total Input Tokens 46,740 39,745
Output Tokens 3,230 10,310
Reasoning Tokens 78,406 123,501
Response Time (avg) 47.07s 45.85s
Response Time (max) 254.92s 218.13s
Response Time (total) 988.37s 962.79s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#20 Seed-2.0-Lite

medium
Cost
$0.005
Time
86.7s
Tokens
2,354 tok

#32 DeepSeek V4 Flash

high
Cost
$0.003
Time
93.1s
Tokens
7,926 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 8.3 10.0 75.0% 0 17.99s 942 996 7,142
DeepSeek V4 Flash 8.3 10.0 75.0% 0 28.51s 540 140 7,770
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 8.0 9.8 66.7% 0 156.74s 8,247 458 31,890
DeepSeek V4 Flash 7.8 10.0 66.7% 0 50.60s 7,279 395 34,862
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 37.67s 16,254 506 4,299
DeepSeek V4 Flash 10.0 10.0 100.0% 0 76.57s 14,016 465 7,347
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 9.07s 8,562 246 1,742
DeepSeek V4 Flash 10.0 10.0 100.0% 0 28.03s 7,290 201 1,179
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 5.9 7.2 55.6% 1 88.74s 843 15 23,897
DeepSeek V4 Flash 4.1 4.4 44.5% 2 100.31s 666 27 59,249
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 6.7 3.6 66.7% 1 18.25s 582 304 1,620
DeepSeek V4 Flash 6.1 3.1 66.7% 1 25.15s 471 79 632
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 7.26s 834 71 1,480
DeepSeek V4 Flash 10.0 10.0 100.0% 0 15.36s 627 63 1,622
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 9.0 7.9 88.9% 1 10.23s 894 403 3,285
DeepSeek V4 Flash 8.2 7.2 88.9% 1 26.11s 594 196 1,767
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 12.38s 9,306 222 1,011
DeepSeek V4 Flash 10.0 10.0 100.0% 0 74.73s 8,079 228 542
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Lite 3.0 10.0 0.0% 0 48.32s 276 9 2,040
DeepSeek V4 Flash 3.0 10.0 0.0% 0 54.46s 183 8,516 8,531

Quick Compare

Switch Comparison Pair