Claude Sonnet 4.6 vs Nemotron 3 Ultra (medium)

Recommended model Claude Sonnet 4.6

Its score stays close to the best score here (7.3 vs 7.5), while responding about 4.0x faster than Nemotron 3 Ultra (medium).

Detailed comparison

Metric	Claude Sonnet 4.6 Claude Sonnet 4.6 none Release: 2026-02-17	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available

Metric	Claude Sonnet 4.6 Claude Sonnet 4.6 none Release: 2026-02-17	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available
Score	7.3	7.5
Rank	#71	#58
Reliability	10.0	9.8
Consistency	9.7	8.5
Tests Correct
Attempt pass rate	57.6%	68.2%
Flaky tests	1	4
Total Runs	66	66
Cost per result	5.502	0.000
Total Cost	$0.661	$0.774
Input Price	$3.000 / 1M	$0.600 / 1M
Output Price	$15.000 / 1M	$3.600 / 1M
Total Input Tokens	123,264	233,488
Output Tokens	19,362	57,916
Reasoning Tokens	0	128,062
Response Time (avg)	8.12s	32.21s
Response Time (max)	51.18s	392.56s
Response Time (total)	121.78s	708.65s

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

none

medium

Invalid SVG

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	4.8	10.0	25.0%	0		2.94s	636	1,214	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		8.62s	780	835	1,485

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	5.5	10.0	33.3%	0		5.19s	8,522	2,127	0
Nemotron 3 Ultra	8.4	7.4	88.9%	1		26.53s	7,686	2,854	17,725

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	9.8	10.0	100.0%	0		37.51s	91,402	13,663	0
Nemotron 3 Ultra	6.3	5.8	66.7%	1		218.25s	204,249	40,954	78,561

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	10.0	10.0	100.0%	0		3.43s	8,574	252	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		5.68s	7,989	473	1,285

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	7.7	10.0	66.7%	0		3.54s	759	413	0
Nemotron 3 Ultra	3.5	4.4	33.3%	2		24.90s	858	11,169	16,249

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	6.1	3.1	66.7%	1		2.56s	513	192	0
Nemotron 3 Ultra	3.7	9.5	0.0%	0		2.52s	360	70	235

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	6.5	10.0	50.0%	0		1.96s	690	90	0
Nemotron 3 Ultra	9.8	10.0	100.0%	0		6.35s	765	182	1,243

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	7.7	10.0	66.7%	0		2.53s	663	533	0
Nemotron 3 Ultra	5.5	9.9	33.3%	0		3.54s	792	771	2,055

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	10.0	10.0	100.0%	0		4.11s	11,301	447	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		7.72s	9,781	304	984

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Claude Sonnet 4.6	3.0	10.0	0.0%	0		4.67s	204	431	0
Nemotron 3 Ultra	3.0	10.0	0.0%	0		38.47s	228	304	8,240

Switch Comparison Pair