Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

DeepSeek: DeepSeek V4 Pro vs OpenAI: GPT-5 Nano

Last updated at: 2026-06-01

Metric DeepSeek V4 Pro DeepSeek V4 Pro high Release: 2026-04-24 GPT-5 Nano GPT-5 Nano medium Release: 2025-08-07
Score 6.4 6.1
Rank #96 #100
Reliability 8.9 10.0
Consistency 8.7 7.1
Tests Correct
Attempt pass rate 55.0% 55.0%
Flaky tests 6 7
Total Runs 60 60
Cost per result 1.935 0.952
Total Cost $0.062 $0.077
Input Price $0.435 / 1M $0.050 / 1M
Output Price $0.870 / 1M $0.400 / 1M
Output Tokens 12,244 5,328
Reasoning Tokens 53,958 181,056
Response Time (avg) 58.92s 43.52s
Response Time (max) 358.35s 204.02s
Response Time (total) 1119.51s 565.82s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 7.0 10.0 58.3% 1 16.53s 71 3,617
GPT-5 Nano 6.5 7.9 58.3% 1 25.50s 1,221 21,184
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 2.6 5.0 16.7% 1 51.77s 105 2,641
GPT-5 Nano 5.4 6.6 33.3% 1 47.80s 604 30,144
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 10.0 10.0 100.0% 0 65.02s 465 5,914
GPT-5 Nano 10.0 10.0 100.0% 0 65.96s 578 17,984
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 8.3 10.0 83.3% 1 23.62s 229 1,710
GPT-5 Nano 3.7 1.7 50.0% 2 21.42s 453 10,560
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 2.9 6.9 11.1% 1 205.66s 10,529 28,089
GPT-5 Nano 5.2 4.4 55.6% 2 204.02s 237 64,448
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 6.1 3.1 66.7% 1 25.09s 76 1,152
GPT-5 Nano 4.1 10.0 0.0% 0 17.51s 202 4,608
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 10.0 10.0 100.0% 0 41.16s 205 2,416
GPT-5 Nano 9.8 10.0 100.0% 0 15.64s 312 4,736
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 6.6 10.0 55.6% 1 34.84s 139 4,019
GPT-5 Nano 5.3 7.2 44.4% 1 20.63s 929 14,272
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 10.0 10.0 100.0% 0 21.33s 372 593
GPT-5 Nano 10.0 10.0 100.0% 0 33.30s 558 6,976
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
DeepSeek V4 Pro 3.0 10.0 0.0% 0 39.14s 53 3,807
GPT-5 Nano 3.0 10.0 0.0% 0 20.13s 234 6,144

Quick Compare

Switch Comparison Pair