Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Mini vs OpenAI: GPT-5 Mini

Summary

Seed-2.0-Mini vs GPT-5 Mini benchmark comparison: GPT-5 Mini leads on average score with 8.5 vs 7.4. Seed-2.0-Mini has the lower benchmark cost at $0.044 vs $0.159. GPT-5 Mini is faster at 23.64s vs 80.22s, with pass rates of 57.1% vs 63.5%.

Recommended model: GPT-5 Mini - It has the best score here (8.5), while responding about 3.4x faster than Seed-2.0-Mini.

Last updated at: 2026-06-12

Metric Seed-2.0-Mini Seed-2.0-Mini medium Release: 2026-02-14 GPT-5 Mini GPT-5 Mini medium Release: 2025-08-07
Score 7.4 8.5
Rank #51 #19
Reliability 6.7 10.0
Consistency 9.3 9.1
Tests Correct
Attempt pass rate 57.1% 63.5%
Flaky tests 2 2
Total Runs 63 63
Cost per result 0.397 1.319
Total Cost $0.044 $0.159
Input Price $0.100 / 1M $0.250 / 1M
Output Price $0.400 / 1M $2.000 / 1M
Total Input Tokens 41,904 37,100
Output Tokens 2,555 6,801
Reasoning Tokens 95,974 67,690
Response Time (avg) 80.22s 23.64s
Response Time (max) 262.83s 88.15s
Response Time (total) 1363.72s 496.44s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#51 Seed-2.0-Mini

medium
Cost
$0.002
Time
161.7s
Tokens
4,379 tok

#19 GPT-5 Mini

medium
Cost
$0.007
Time
42.9s
Tokens
3,432 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 6.6 10.0 50.0% 0 74.75s 791 360 9,520
GPT-5 Mini 7.1 7.6 66.7% 1 13.86s 606 1,715 6,378
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.5 9.8 33.3% 0 220.48s 3,823 464 34,964
GPT-5 Mini 10.0 10.0 100.0% 0 27.63s 7,302 658 17,152
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 262.83s 16,533 404 29,806
GPT-5 Mini 10.0 10.0 100.0% 0 88.15s 14,118 754 11,520
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 24.27s 8,568 246 2,743
GPT-5 Mini 10.0 10.0 100.0% 0 12.58s 7,140 453 3,200
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 0ms 0 0 0
GPT-5 Mini 3.6 7.2 22.2% 1 44.63s 515 293 14,016
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 5.1 3.4 33.3% 1 36.65s 585 213 4,210
GPT-5 Mini 4.5 10.0 0.0% 0 13.50s 477 349 1,856
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 17.47s 840 69 2,050
GPT-5 Mini 10.0 10.0 100.0% 0 11.59s 660 310 3,968
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 8.2 7.2 88.9% 1 31.79s 903 527 5,667
GPT-5 Mini 5.6 9.8 33.3% 0 15.20s 642 1,622 6,144
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 10.0 10.0 100.0% 0 88.68s 9,585 222 5,235
GPT-5 Mini 10.0 10.0 100.0% 0 18.64s 5,445 487 1,600
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Seed-2.0-Mini 3.0 10.0 0.0% 0 56.76s 276 50 1,779
GPT-5 Mini 3.0 10.0 0.0% 0 9.99s 195 160 1,856

Quick Compare

Switch Comparison Pair