DeepSeek V4 Flash (high) vs GPT-5.4 Nano (medium)

Recommended model DeepSeek V4 Flash (high)

It has the best score here (7.7), while costing about 2.3x less than GPT-5.4 Nano (medium).

Detailed comparison

Metric	DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24	GPT-5.4 Nano GPT-5.4 Nano medium Release: 2026-03-17

Metric	DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24	GPT-5.4 Nano GPT-5.4 Nano medium Release: 2026-03-17
Score	7.7	7.5
Rank	#53	#62
Reliability	10.0	10.0
Consistency	8.2	8.5
Tests Correct
Attempt pass rate	72.7%	65.2%
Flaky tests	5	4
Total Runs	66	66
Cost per result	0.402	1.150
Total Cost	$0.060	$0.138
Input Price	$0.140 / 1M	$0.200 / 1M
Output Price	$0.280 / 1M	$1.250 / 1M
Total Input Tokens	108,392	82,819
Output Tokens	14,478	7,100
Reasoning Tokens	153,687	90,022
Response Time (avg)	49.75s	13.24s
Response Time (max)	218.13s	94.06s
Response Time (total)	1094.41s	291.33s

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

high

medium

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	8.3	10.0	75.0%	0		28.51s	540	140	7,770
GPT-5.4 Nano	8.3	10.0	75.0%	0		4.52s	606	683	2,254

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	7.8	10.0	66.7%	0		50.60s	7,279	395	34,862
GPT-5.4 Nano	6.1	4.7	66.7%	2		19.12s	7,305	516	20,778

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	6.4	5.8	66.7%	1		104.10s	82,663	4,633	37,533
GPT-5.4 Nano	9.9	10.0	100.0%	0		32.24s	59,730	4,435	19,221

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		28.03s	7,290	201	1,179
GPT-5.4 Nano	10.0	10.0	100.0%	0		2.54s	7,140	234	516

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	4.1	4.4	44.5%	2		100.31s	666	27	59,249
GPT-5.4 Nano	5.9	7.2	55.6%	1		38.18s	619	60	43,325

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	6.1	3.1	66.7%	1		25.15s	471	79	632
GPT-5.4 Nano	4.5	10.0	0.0%	0		4.15s	477	179	443

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		15.36s	627	63	1,622
GPT-5.4 Nano	9.8	10.0	100.0%	0		1.88s	660	95	521

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	8.2	7.2	88.9%	1		26.11s	594	196	1,767
GPT-5.4 Nano	4.1	7.2	22.2%	1		3.79s	642	594	1,408

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		74.73s	8,079	228	542
GPT-5.4 Nano	10.0	10.0	100.0%	0		7.71s	5,445	234	382

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	3.0	10.0	0.0%	0		54.46s	183	8,516	8,531
GPT-5.4 Nano	3.0	10.0	0.0%	0		4.81s	195	70	1,174

Switch Comparison Pair