Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Anthropic: Claude Opus 4.8 vs DeepSeek: DeepSeek V4 Flash

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-28

Kipimo Claude Opus 4.8 Claude Opus 4.8 medium Toleo: 2026-05-28 DeepSeek V4 Flash DeepSeek V4 Flash high Toleo: 2026-04-24 Inapatikana bure
Alama 8.7 7.6
Nafasi #12 #45
Uaminifu 10.0 10.0
Uthabiti 9.6 8.4
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 83.3% 73.3%
Majaribio yasiyo thabiti 1 4
Jumla ya uendeshaji 60 60
Gharama kwa matokeo 6.285 0.309
Jumla ya gharama $1.006 $0.028
Bei ya ingizo $5.000 / 1M $0.100 / 1M
Bei ya toleo $25.000 / 1M $0.200 / 1M
Tokeni za matokeo 23,201 10,302
Tokeni za hoja 5,901 115,740
Muda wa majibu (wastani) 9.34s 46.36s
Muda wa majibu (upeo) 38.03s 218.13s
Muda wa majibu (jumla) 186.84s 927.27s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 3.95s 1,179 478
DeepSeek V4 Flash 8.3 10.0 75.0% 0 28.51s 140 7,770
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 14.97s 6,651 1,381
DeepSeek V4 Flash 6.8 10.0 50.0% 0 58.13s 387 27,101
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 9.8 10.0 100.0% 0 38.03s 5,260 1,588
DeepSeek V4 Flash 10.0 10.0 100.0% 0 76.57s 465 7,347
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 7.1 5.6 83.3% 1 12.29s 481 312
DeepSeek V4 Flash 10.0 10.0 100.0% 0 28.03s 201 1,179
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 5.3 10.0 33.3% 0 14.15s 7,477 900
DeepSeek V4 Flash 4.1 4.4 44.5% 2 100.31s 27 59,249
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 2.46s 237 0
DeepSeek V4 Flash 6.1 3.1 66.7% 1 25.15s 79 632
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 3.32s 373 320
DeepSeek V4 Flash 10.0 10.0 100.0% 0 15.36s 63 1,622
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 3.95s 791 483
DeepSeek V4 Flash 8.2 7.2 88.9% 1 26.11s 196 1,767
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 10.0 10.0 100.0% 0 8.96s 301 225
DeepSeek V4 Flash 10.0 10.0 100.0% 0 74.73s 228 542
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Claude Opus 4.8 3.0 10.0 0.0% 0 6.14s 451 214
DeepSeek V4 Flash 3.0 10.0 0.0% 0 54.46s 8,516 8,531

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho