Nemotron 3 Ultra (medium) vs Qwen3.6 Flash

Recommended model Nemotron 3 Ultra (medium)

It has the strongest score in this comparison (7.5) and the best overall balance of cost and response time across all 2 models.

Detailed comparison

Metric	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available	Qwen3.6 Flash Qwen3.6 Flash none Release: 2026-04-20

Metric	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available	Qwen3.6 Flash Qwen3.6 Flash none Release: 2026-04-20
Score	7.5	6.1
Rank	#58	#135
Reliability	9.8	10.0
Consistency	8.5	9.6
Tests Correct
Attempt pass rate	68.2%	34.9%
Flaky tests	4	1
Total Runs	66	66
Cost per result	0.000	0.935
Total Cost	$0.774	$0.062
Input Price	$0.600 / 1M	$0.188 / 1M
Output Price	$3.600 / 1M	$1.125 / 1M
Total Input Tokens	233,488	139,788
Output Tokens	57,916	30,947
Reasoning Tokens	128,062	0
Response Time (avg)	32.21s	3.74s
Response Time (max)	392.56s	48.79s
Response Time (total)	708.65s	82.38s

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

medium

Invalid SVG

none

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	10.0	10.0	100.0%	0		8.62s	780	835	1,485
Qwen3.6 Flash	3.1	10.0	0.0%	0		1.63s	696	1,554	0

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	8.4	7.4	88.9%	1		26.53s	7,686	2,854	17,725
Qwen3.6 Flash	5.4	10.0	33.3%	0		1.79s	6,488	889	0

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	6.3	5.8	66.7%	1		218.25s	204,249	40,954	78,561
Qwen3.6 Flash	3.8	5.8	33.3%	1		26.50s	113,653	27,098	0

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	10.0	10.0	100.0%	0		5.68s	7,989	473	1,285
Qwen3.6 Flash	10.0	10.0	100.0%	0		2.13s	7,794	243	0

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	3.5	4.4	33.3%	2		24.90s	858	11,169	16,249
Qwen3.6 Flash	5.3	10.0	33.3%	0		1.11s	789	15	0

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	3.7	9.5	0.0%	0		2.52s	360	70	235
Qwen3.6 Flash	10.0	10.0	100.0%	0		947ms	522	132	0

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	9.8	10.0	100.0%	0		6.35s	765	182	1,243
Qwen3.6 Flash	6.3	10.0	50.0%	0		1.10s	711	66	0

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	5.5	9.9	33.3%	0		3.54s	792	771	2,055
Qwen3.6 Flash	3.5	10.0	0.0%	0		1.21s	714	669	0

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	10.0	10.0	100.0%	0		7.72s	9,781	304	984
Qwen3.6 Flash	10.0	10.0	100.0%	0		2.49s	8,211	272	0

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Nemotron 3 Ultra	3.0	10.0	0.0%	0		38.47s	228	304	8,240
Qwen3.6 Flash	3.0	10.0	0.0%	0		649ms	210	9	0

Switch Comparison Pair