inclusionAI: Ling-2.6-flash vs NVIDIA: Nemotron 3 Ultra

Nemotron 3 Ultra (medium) leads on average score with 7.5 vs 4.9. Ling-2.6-flash has the lower benchmark cost at $0.002 vs $0.774. Ling-2.6-flash is faster at 10.68s vs 32.21s, with pass rates of 30.3% vs 68.2%.

Recommended modelNemotron 3 Ultra (medium)It has the strongest score in this comparison (7.5) and the best overall balance of cost and response time across all 2 models.

Last updated at: 2026-07-21

Metric	Ling-2.6-flash Ling-2.6-flash none Release: 2026-04-21	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available

Metric	Ling-2.6-flash Ling-2.6-flash none Release: 2026-04-21	Nemotron 3 Ultra Nemotron 3 Ultra medium Release: 2026-06-04 Free Available
Score	4.9	7.5
Rank	#184	#55
Reliability	10.0	9.8
Consistency	9.3	8.5
Tests Correct
Attempt pass rate	30.3%	68.2%
Flaky tests	2	4
Total Runs	66	66
Cost per result	0.024	0.000
Total Cost	$0.002	$0.774
Input Price	$0.010 / 1M	$0.600 / 1M
Output Price	$0.030 / 1M	$3.600 / 1M
Total Input Tokens	114,375	233,488
Output Tokens	14,903	57,916
Reasoning Tokens	0	128,062
Response Time (avg)	10.68s	32.21s
Response Time (max)	36.03s	392.56s
Response Time (total)	213.51s	708.65s

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#184 Ling-2.6-flash

none

Ling-2.6-flash is no longer available as a free model. It has transitioned to a paid model. Continue using it here: https://openrouter.ai/inclusionai/ling-2.6-flash

Cost: $0.000
Time: 0.0s
Tokens: 0 tok

#55 Nemotron 3 Ultra

medium

Invalid SVG

Cost: $0.000
Time: 300.0s
Tokens: 0 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	6.8	8.1	58.3%	1		11.81s	726	573	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		8.62s	780	835	1,485

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	5.3	10.0	33.3%	0		11.21s	813	381	0
Nemotron 3 Ultra	8.4	7.4	88.9%	1		26.53s	7,686	2,854	17,725

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	3.0	10.0	0.0%	0		35.69s	94,475	13,094	0
Nemotron 3 Ultra	6.3	5.8	66.7%	1		218.25s	204,249	40,954	78,561

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	6.5	10.0	50.0%	0		8.48s	8,004	246	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		5.68s	7,989	473	1,285

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	3.0	10.0	0.0%	0		4.95s	810	24	0
Nemotron 3 Ultra	3.5	4.4	33.3%	2		24.90s	858	11,169	16,249

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	4.0	10.0	0.0%	0		1.45s	540	109	0
Nemotron 3 Ultra	3.7	9.5	0.0%	0		2.52s	360	70	235

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	9.8	10.0	100.0%	0		5.52s	732	81	0
Nemotron 3 Ultra	9.8	10.0	100.0%	0		6.35s	765	182	1,243

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	2.9	7.2	11.1%	1		6.51s	729	151	0
Nemotron 3 Ultra	5.5	9.9	33.3%	0		3.54s	792	771	2,055

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	3.0	10.0	0.0%	0		18.80s	7,324	229	0
Nemotron 3 Ultra	10.0	10.0	100.0%	0		7.72s	9,781	304	984

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Ling-2.6-flash	3.0	10.0	0.0%	0		1.06s	222	15	0
Nemotron 3 Ultra	3.0	10.0	0.0%	0		38.47s	228	304	8,240

Quick Compare

Switch Comparison Pair