AI BENCHY Compare

NVIDIA: Nemotron 3 Super vs Elephant Alpha

Last updated at: 2026-04-26

Metric	Nemotron 3 Super Nemotron 3 Super none Release: 2026-03-11 Free Available	Elephant Alpha Elephant Alpha none Release: 2026-04-14

Metric	Nemotron 3 Super Nemotron 3 Super none Release: 2026-03-11 Free Available	Elephant Alpha Elephant Alpha none Release: 2026-04-14
Score	5.1	5.2
Rank	#103	#99
Reliability	N/A	N/A
Consistency	8.2	9.6
Tests Correct
Attempt pass rate	35.2%	31.5%
Flaky tests	4	1
Total Runs	52	54
Cost per result	0.000	0.000
Total Cost	$0.000	$0.000
Input Price	$0.090 / 1M	$0.000 / 1M
Output Price	$0.450 / 1M	$0.000 / 1M
Output Tokens	4,760	2,573
Reasoning Tokens	0	0
Response Time (avg)	8.54s	1.23s
Response Time (max)	24.97s	3.81s
Response Time (total)	153.69s	22.16s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	4.8	10.0	25.0%	0		7.43s	2,174	0
Elephant Alpha	6.6	10.0	50.0%	0		963ms	610	0

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	3.3	1.6	33.3%	1		2.99s	535	0
Elephant Alpha	6.4	3.3	66.7%	1		1.39s	375	0

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	3.0	10.0	0.0%	0		19.98s	124	0
Elephant Alpha	3.0	10.0	0.0%	0		3.81s	731	0

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	10.0	10.0	100.0%	0		7.92s	249	0
Elephant Alpha	6.5	10.0	50.0%	0		1.04s	246	0

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	3.6	7.2	22.2%	1		6.23s	26	0
Elephant Alpha	3.0	10.0	0.0%	0		927ms	24	0

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	4.2	9.9	0.0%	0		24.97s	170	0
Elephant Alpha	4.0	10.0	0.0%	0		854ms	106	0

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	4.9	6.9	33.3%	1		1.50s	66	0
Elephant Alpha	9.8	10.0	100.0%	0		1.03s	81	0

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	5.7	10.0	33.3%	0		7.50s	1,135	0
Elephant Alpha	3.3	10.0	0.0%	0		849ms	170	0

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Output Tokens	Reasoning Tokens
Nemotron 3 Super	4.7	1.6	66.7%	1		16.00s	281	0
Elephant Alpha	3.0	10.0	0.0%	0		2.79s	230	0

Quick Compare

Switch Comparison Pair

MiniMax M2.7mediumvsElephant Alphanone Nemotron 3 SupernoneFree AvailablevsElephant Alphamedium MiniMax M2.7mediumvsNemotron 3 SupernoneFree Available Nemotron 3 SupernoneFree AvailablevsQwen3 Coder Nextmedium Mistral Small 4mediumvsElephant Alphanone Elephant AlphanonevsQwen3 Coder Nextmedium Nemotron 3 SupernoneFree AvailablevsGLM 4.7 Flashmedium MiniMax M2.5mediumFree AvailablevsElephant Alphanone Mistral Small 4mediumvsNemotron 3 SupernoneFree Available Elephant AlphanonevsGLM 4.7 Flashmedium MiniMax M2.5mediumFree AvailablevsNemotron 3 SupernoneFree Available gpt-oss-120bmediumFree AvailablevsElephant Alphanone