Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

DeepSeek: DeepSeek V4 Flash vs Google: Gemini 3 Flash Preview

Last updated at: 2026-04-24

Metric DeepSeek V4 Flash DeepSeek V4 Flash high Release: 2026-04-24 Gemini 3 Flash Preview Gemini 3 Flash Preview medium Release: 2025-12-17
Score 7.8 10.0
Rank #35 #1
Consistency 7.8 10.0
Tests Correct
Attempt pass rate 79.6% 100.0%
Flaky tests 5 0
Total Runs 52 18
Cost per result 0.189 0.600
Total Cost $0.021 $0.108
Input Price $0.140 / 1M $0.500 / 1M
Output Price $0.280 / 1M $3.000 / 1M
Output Tokens 1,757 655
Reasoning Tokens 55,907 33,749
Response Time (avg) 47.47s 12.11s
Response Time (max) 255.28s 82.37s
Response Time (total) 854.45s 217.93s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 8.3 10.0 75.0% 0 28.51s 140 7,770
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.26s 110 1,076
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 10.0 10.0 100.0% 0 62.48s 369 9,361
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 82.37s 144 16,257
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 10.0 10.0 100.0% 0 76.57s 465 7,347
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 23.58s 117 3,495
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 10.0 10.0 100.0% 0 28.03s 201 1,179
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 7.62s 93 2,197
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 4.1 4.4 44.5% 2 112.69s 19 24,857
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 14.81s 4 7,228
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 6.1 3.1 66.7% 1 25.15s 79 632
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 6.34s 24 635
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 10.0 10.0 100.0% 0 15.36s 63 1,622
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.30s 24 903
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 6.4 4.5 77.8% 2 25.53s 193 2,597
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.86s 61 1,455
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Flash 10.0 10.0 100.0% 0 74.73s 228 542
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 9.78s 78 503

Quick Compare

Switch Comparison Pair