Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

DeepSeek: DeepSeek V4 Flash vs xAI: Grok Build 0.1

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-21

Kipimo DeepSeek V4 Flash DeepSeek V4 Flash high Toleo: 2026-04-24 Inapatikana bure Grok Build 0.1 Grok Build 0.1 medium Toleo: 2026-05-21
Alama 7.6 7.8
Nafasi #54 #41
Uaminifu 10.0 10.0
Uthabiti 7.9 8.9
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 75.4% 71.9%
Majaribio yasiyo thabiti 5 3
Jumla ya uendeshaji 57 57
Gharama kwa matokeo 0.299 4.064
Jumla ya gharama $0.033 $0.488
Bei ya ingizo $0.112 / 1M $1.000 / 1M
Bei ya toleo $0.224 / 1M $2.000 / 1M
Tokeni za matokeo 10,281 1,947
Tokeni za hoja 98,830 223,372
Muda wa majibu (wastani) 45.88s 22.28s
Muda wa majibu (upeo) 218.13s 88.28s
Muda wa majibu (jumla) 871.76s 423.30s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 8.3 10.0 75.0% 0 28.51s 140 7,770
Grok Build 0.1 10.0 10.0 100.0% 0 5.46s 195 9,825
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 10.0 10.0 100.0% 0 62.48s 369 9,361
Grok Build 0.1 7.3 3.7 66.7% 1 30.98s 354 17,734
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 10.0 10.0 100.0% 0 76.57s 465 7,347
Grok Build 0.1 10.0 10.0 100.0% 0 30.81s 231 18,779
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 10.0 10.0 100.0% 0 28.03s 201 1,179
Grok Build 0.1 10.0 10.0 100.0% 0 7.76s 180 10,343
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 4.1 4.4 44.5% 2 100.31s 27 59,249
Grok Build 0.1 5.3 10.0 33.3% 0 77.75s 501 111,807
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 6.1 3.1 66.7% 1 25.15s 79 632
Grok Build 0.1 3.8 2.5 33.3% 1 10.14s 78 5,386
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 10.0 10.0 100.0% 0 15.36s 63 1,622
Grok Build 0.1 9.8 10.0 100.0% 0 9.62s 57 12,436
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 6.4 4.4 77.8% 2 25.53s 193 2,597
Grok Build 0.1 6.2 7.5 55.6% 1 8.67s 161 15,476
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 10.0 10.0 100.0% 0 74.73s 228 542
Grok Build 0.1 10.0 10.0 100.0% 0 9.40s 180 5,319
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V4 Flash 3.0 10.0 0.0% 0 54.46s 8,516 8,531
Grok Build 0.1 3.0 10.0 0.0% 0 26.07s 10 16,267

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho