Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Nemotron 3 Super 120b A12b

Last updated at: 2026-03-12

Metric Seed-2.0-Lite Seed-2.0-Lite none Release: 2026-02-14 Nemotron 3 Super 120b A12b Nemotron 3 Super 120b A12b medium Release: 2026-03-11 Free Available
Rank #45 #36
Avg Score 4.9 5.8
Consistency 7.4 8.5
Cost per result 0.214 0.000
Total Cost $0.015 $0.000
Tests Correct
Attempt pass rate 56.3% 56.3%
Flaky tests 5 3
Total Runs 48 48
Output Tokens 2,743 11,925
Reasoning Tokens 0 29,687
Response Time (avg) 2.49s 20.24s
Response Time (max) 6.70s 87.80s
Response Time (total) 39.91s 303.60s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Total Output Tokens

Avg Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 4.6 22.2% 2 2.93s 703 0
Nemotron 3 Super 120b A12b 10.0 10.0 100.0% 0 12.96s 1,754 3,264
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 0.0% 0 6.59s 498 0
Nemotron 3 Super 120b A12b 10.0 10.0 100.0% 0 87.80s 2,021 9,996
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 9.9 10.0 100.0% 0 1.82s 246 0
Nemotron 3 Super 120b A12b 9.9 10.0 100.0% 0 18.16s 877 2,607
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 7.2 22.2% 1 1.33s 17 0
Nemotron 3 Super 120b A12b 10.0 4.4 22.2% 2 16.19s 5,255 6,072
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.45s 294 0
Nemotron 3 Super 120b A12b 2.0 9.9 0.0% 0 27.86s 104 1,149
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.06s 73 0
Nemotron 3 Super 120b A12b 7.0 6.5 66.7% 1 7.72s 1,042 2,479
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 4.0 4.4 55.6% 2 2.46s 620 0
Nemotron 3 Super 120b A12b 1.3 9.8 0.0% 0 8.39s 602 2,151
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.94s 292 0
Nemotron 3 Super 120b A12b 10.0 10.0 100.0% 0 39.75s 270 1,969

Quick Compare

Switch Comparison Pair