Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

OpenAI: GPT-5.4 vs xAI: Grok 4.3

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-22

Kipimo GPT-5.4 GPT-5.4 medium Toleo: 2026-03-05 Grok 4.3 Grok 4.3 medium Toleo: 2026-05-01
Alama 7.9 7.8
Nafasi #27 #31
Uaminifu 10.0 10.0
Uthabiti 8.5 8.4
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 75.0% 75.0%
Majaribio yasiyo thabiti 4 4
Jumla ya uendeshaji 60 60
Gharama kwa matokeo 8.767 4.562
Jumla ya gharama $1.140 $0.593
Bei ya ingizo $2.500 / 1M $1.250 / 1M
Bei ya toleo $15.000 / 1M $2.500 / 1M
Tokeni za matokeo 2,222 1,485
Tokeni za hoja 68,503 214,928
Muda wa majibu (wastani) 22.31s 49.23s
Muda wa majibu (upeo) 100.41s 216.69s
Muda wa majibu (jumla) 446.15s 984.54s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 8.3 10.0 75.0% 0 4.11s 240 1,511
Grok 4.3 10.0 10.0 100.0% 0 8.83s 88 8,207
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 8.2 6.7 83.3% 1 54.98s 412 19,995
Grok 4.3 7.4 6.5 66.7% 1 55.26s 532 24,554
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 10.0 10.0 100.0% 0 20.57s 301 3,543
Grok 4.3 10.0 10.0 100.0% 0 63.99s 234 15,301
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 10.0 10.0 100.0% 0 5.32s 234 804
Grok 4.3 10.0 10.0 100.0% 0 18.97s 180 9,546
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 5.3 7.2 44.4% 1 74.27s 61 34,748
Grok 4.3 5.3 7.2 44.4% 1 181.74s 14 111,300
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 4.7 3.1 33.3% 1 4.92s 145 321
Grok 4.3 5.4 2.5 66.7% 1 24.70s 70 5,020
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 10.0 10.0 100.0% 0 3.11s 93 897
Grok 4.3 9.8 10.0 100.0% 0 18.58s 57 8,713
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 8.2 7.2 88.9% 1 9.13s 442 3,832
Grok 4.3 5.9 7.2 55.6% 1 22.53s 128 14,686
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 10.0 10.0 100.0% 0 13.28s 264 1,031
Grok 4.3 10.0 10.0 100.0% 0 17.66s 168 4,615
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
GPT-5.4 3.0 10.0 0.0% 0 13.95s 30 1,821
Grok 4.3 3.0 10.0 0.0% 0 44.47s 14 12,986

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho