Navigate
AI BENCHY
Compare Charts
❤️ Made by XCS
Your ad here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Mini vs Google: Gemini 3.1 Flash Lite Preview

Compare:

Last updated at: 2026-03-05

Metric ByteDance Seed: Seed-2.0-Mini medium Release: 2026-02-14 Google: Gemini 3.1 Flash Lite Preview low Release: 2026-03-03
Avg Score 7.0 7.6
Tests Correct
Rank #23 #12
Consistency 9.4 10.0
Cost per result 0.261 0.170
Total Cost $0.027 $0.019
Attempt pass rate 71.1% 73.3%
Flaky tests 1 0
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Output Tokens 1,752 1,542
Reasoning Tokens 54,246 6,888
Response Time (avg) 67.46s 3.49s
Response Time (max) 262.83s 11.91s
Response Time (total) 809.49s 52.29s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 7.0 10.0 66.7% 0 98.99s 354 9,352
Google: Gemini 3.1 Flash Lite Preview 7.0 10.0 66.7% 0 2.18s 456 1,224
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 10.0 10.0 100.0% 0 262.83s 404 29,806
Google: Gemini 3.1 Flash Lite Preview 10.0 10.0 0.0% 0 11.91s 225 762
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 9.9 10.0 100.0% 0 24.27s 246 2,743
Google: Gemini 3.1 Flash Lite Preview 9.9 10.0 100.0% 0 3.00s 291 696
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 10.0 10.0 0.0% 0 0ms 0 0
Google: Gemini 3.1 Flash Lite Preview 4.0 10.0 33.3% 0 2.36s 18 1,212
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 10.0 10.0 100.0% 0 17.47s 69 2,050
Google: Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.49s 72 753
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 7.0 7.2 88.9% 1 25.85s 457 5,060
Google: Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 2.76s 243 1,248
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
ByteDance Seed: Seed-2.0-Mini 10.0 10.0 100.0% 0 88.68s 222 5,235
Google: Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 9.54s 237 993

Quick Compare

Switch Comparison Pair