Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

NVIDIA: Nemotron 3 Super vs Qwen: Qwen3.5-35B-A3B

Summary

Nemotron 3 Super vs Qwen3.5-35B-A3B benchmark comparison: Qwen3.5-35B-A3B leads on average score with 5.6 vs 4.9. Nemotron 3 Super has the lower benchmark cost at $0.007 vs $0.012. Qwen3.5-35B-A3B is faster at 3.37s vs 5.30s, with pass rates of 31.8% vs 42.9%.

Recommended model: Qwen3.5-35B-A3B - It has the best score here (5.6), while responding about 1.6x faster than Nemotron 3 Super.

Last updated at: 2026-06-10

Metric Nemotron 3 Super Nemotron 3 Super none Release: 2026-03-11 Free Available Qwen3.5-35B-A3B Qwen3.5-35B-A3B none Release: 2026-02-24
Score 4.9 5.6
Rank #142 #118
Reliability 10.0 10.0
Consistency 8.8 8.9
Tests Correct
Attempt pass rate 31.8% 42.9%
Flaky tests 3 3
Total Runs 63 63
Cost per result 0.034 0.230
Total Cost $0.007 $0.012
Input Price $0.090 / 1M $0.140 / 1M
Output Price $0.450 / 1M $1.000 / 1M
Total Input Tokens 36,456 48,194
Output Tokens 6,195 4,343
Reasoning Tokens 0 0
Response Time (avg) 5.30s 3.37s
Response Time (max) 16.45s 47.43s
Response Time (total) 111.31s 70.75s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#142 Nemotron 3 Super

none
No showcase result has been generated for this model yet.
Cost
$0.000
Time
-
Tokens
0 tok

#118 Qwen3.5-35B-A3B

none
Cost
$0.005
Time
28.4s
Tokens
4,518 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 4.8 10.0 25.0% 0 4.46s 671 2,322 0
Qwen3.5-35B-A3B 3.4 7.9 16.7% 1 1.43s 696 574 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 3.3 7.2 11.1% 1 2.64s 7,627 571 0
Qwen3.5-35B-A3B 5.5 10.0 33.3% 0 1.39s 7,808 571 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 3.0 10.0 0.0% 0 16.45s 8,740 617 0
Qwen3.5-35B-A3B 3.0 10.0 0.0% 0 47.43s 20,739 1,833 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 10.0 10.0 100.0% 0 7.92s 7,944 249 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 1.16s 7,794 243 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 3.6 7.2 22.2% 1 6.23s 789 26 0
Qwen3.5-35B-A3B 7.7 10.0 66.7% 0 485ms 789 15 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 4.6 10.0 0.0% 0 950ms 500 134 0
Qwen3.5-35B-A3B 6.5 3.4 66.7% 1 1.19s 522 114 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 6.3 10.0 50.0% 0 804ms 723 66 0
Qwen3.5-35B-A3B 6.3 10.0 50.0% 0 809ms 711 63 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 5.5 10.0 33.3% 0 2.36s 714 1,125 0
Qwen3.5-35B-A3B 3.7 7.4 22.2% 1 1.35s 714 655 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 4.7 1.6 66.7% 1 16.00s 8,541 281 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 2.30s 8,211 264 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Nemotron 3 Super 3.0 10.0 0.0% 0 8.94s 207 804 0
Qwen3.5-35B-A3B 3.0 10.0 0.0% 0 493ms 210 11 0

Quick Compare

Switch Comparison Pair