DeepSeek: DeepSeek V4 Flash vs Google: Gemini 3.5 Flash

Gemini 3.5 Flash (high) leads on average score with 9.5 vs 7.7. DeepSeek V4 Flash (high) has the lower benchmark cost at $0.041 vs $1.976. Gemini 3.5 Flash (high) is faster at 15.07s vs 49.75s, with pass rates of 72.7% vs 93.9%.

Recommended modelGemini 3.5 Flash (high)It has the best score here (9.5), while responding about 3.3x faster than DeepSeek V4 Flash (high).

Last updated at: 2026-07-25

Metric	DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24	Gemini 3.5 Flash Gemini 3.5 Flash high Release: 2026-05-19

Metric	DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24	Gemini 3.5 Flash Gemini 3.5 Flash high Release: 2026-05-19
Score	7.7	9.5
Rank	#52	#4
Reliability	10.0	10.0
Consistency	8.2	9.3
Tests Correct
Attempt pass rate	72.7%	93.9%
Flaky tests	5	2
Total Runs	66	66
Cost per result	0.402	9.879
Total Cost	$0.041	$1.976
Input Price	$0.094 / 1M	$1.500 / 1M
Output Price	$0.188 / 1M	$9.000 / 1M
Total Input Tokens	108,392	107,137
Output Tokens	14,478	8,777
Reasoning Tokens	153,687	192,900
Response Time (avg)	49.75s	15.07s
Response Time (max)	218.13s	145.92s
Response Time (total)	1094.41s	331.48s

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#52 DeepSeek V4 Flash

high

Cost: $0.003
Time: 93.1s
Tokens: 7,926 tok

#4 Gemini 3.5 Flash

high

Cost: $0.208
Time: 118.2s
Tokens: 23,158 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	8.3	10.0	75.0%	0		28.51s	540	140	7,770
Gemini 3.5 Flash	10.0	10.0	100.0%	0		2.57s	492	174	4,997

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	7.8	10.0	66.7%	0		50.60s	7,279	395	34,862
Gemini 3.5 Flash	10.0	10.0	100.0%	0		22.96s	8,118	456	47,129

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	6.4	5.8	66.7%	1		104.10s	82,663	4,633	37,533
Gemini 3.5 Flash	8.2	6.9	66.7%	1		84.14s	82,416	7,153	93,585

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		28.03s	7,290	201	1,179
Gemini 3.5 Flash	10.0	10.0	100.0%	0		6.43s	7,548	279	8,466

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	4.1	4.4	44.5%	2		100.31s	666	27	59,249
Gemini 3.5 Flash	7.6	7.2	77.8%	1		14.09s	633	12	24,721

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	6.1	3.1	66.7%	1		25.15s	471	79	632
Gemini 3.5 Flash	10.0	10.0	100.0%	0		3.63s	486	115	1,650

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		15.36s	627	63	1,622
Gemini 3.5 Flash	10.0	10.0	100.0%	0		3.35s	615	70	3,799

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	8.2	7.2	88.9%	1		26.11s	594	196	1,767
Gemini 3.5 Flash	10.0	10.0	100.0%	0		3.23s	558	241	4,940

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	10.0	10.0	100.0%	0		74.73s	8,079	228	542
Gemini 3.5 Flash	9.8	10.0	100.0%	0		4.96s	6,115	265	1,608

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
DeepSeek V4 Flash	3.0	10.0	0.0%	0		54.46s	183	8,516	8,531
Gemini 3.5 Flash	10.0	10.0	100.0%	0		3.94s	156	12	2,005

Quick Compare

Switch Comparison Pair