Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Nemotron 3 Super 120b A12b vs Qwen: Qwen3 Coder Next

Last updated at: 2026-03-12

Metric Nemotron 3 Super 120b A12b Nemotron 3 Super 120b A12b none Release: 2026-03-11 Free Available Qwen3 Coder Next Qwen3 Coder Next medium Release: 2026-02-03
Rank #59 #58
Avg Score 3.4 3.5
Consistency 8.6 9.1
Cost per result 0.000 0.230
Total Cost $0.000 $0.007
Tests Correct
Attempt pass rate 31.3% 27.1%
Flaky tests 3 2
Total Runs 48 48
Output Tokens 4,222 2,935
Reasoning Tokens 0 0
Response Time (avg) 8.90s 12.53s
Response Time (max) 24.97s 81.80s
Response Time (total) 142.40s 125.32s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Total Output Tokens

Avg Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 10.0 10.0 0.0% 0 7.14s 2,171 0
Qwen3 Coder Next 1.3 7.5 22.2% 1 15.28s 1,246 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 10.0 10.0 0.0% 0 19.98s 124 0
Qwen3 Coder Next 10.0 10.0 0.0% 0 4.28s 317 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 9.9 10.0 100.0% 0 7.92s 249 0
Qwen3 Coder Next 5.4 10.0 50.0% 0 81.80s 246 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 10.0 7.2 22.2% 1 6.23s 26 0
Qwen3 Coder Next 4.0 10.0 33.3% 0 638ms 25 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 3.0 9.9 0.0% 0 24.97s 170 0
Qwen3 Coder Next 6.0 3.4 66.7% 1 1.39s 142 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 4.5 6.9 33.3% 1 1.50s 66 0
Qwen3 Coder Next 4.5 10.0 0.0% 0 7.34s 63 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 4.7 10.0 33.3% 0 7.50s 1,135 0
Qwen3 Coder Next 10.0 10.0 0.0% 0 2.30s 641 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Nemotron 3 Super 120b A12b 10.0 1.6 66.7% 1 16.00s 281 0
Qwen3 Coder Next 10.0 10.0 100.0% 0 2.64s 255 0

Quick Compare

Switch Comparison Pair