Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

OpenAI: GPT-5.5 vs Grok 4.20 Multi Agent Beta

Muhtasari

Ulinganisho wa benchmark GPT-5.5 vs Grok 4.20 Multi Agent Beta: GPT-5.5 inaongoza kwa average score: 9.3 vs 5.0. GPT-5.5 ina gharama ya chini ya benchmark: $0.907 vs $5.599. Grok 4.20 Multi Agent Beta ni ya haraka zaidi: 9.69s vs 9.76s, na pass rates 85.7% vs 50.8%.

Muundo unaopendekezwa: GPT-5.5 - It has the best score here (9.3), while costing about 6.2x less than Grok 4.20 Multi Agent Beta.

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-06-18

Kipimo GPT-5.5 GPT-5.5 low Toleo: 2026-04-24 Grok 4.20 Multi Agent Beta Grok 4.20 Multi Agent Beta medium Toleo: 2026-03-12
Alama 9.3 5.0
Nafasi #4 #136
Uaminifu 10.0 Haipo
Uthabiti 10.0 6.7
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 85.7% 50.8%
Majaribio yasiyo thabiti 0 5
Jumla ya uendeshaji 63 52
Gharama kwa matokeo 5.035 62.923
Jumla ya gharama $0.907 $5.599
Bei ya ingizo $5.000 / 1M $4.235 / 1M
Bei ya toleo $30.000 / 1M $4.235 / 1M
Jumla ya tokeni za ingizo 34,209 721,952
Tokeni za matokeo 2,046 294,668
Tokeni za hoja 22,460 305,374
Muda wa majibu (wastani) 9.76s 9.69s
Muda wa majibu (upeo) 56.19s 35.28s
Muda wa majibu (jumla) 204.92s 155.07s

Onyesho la kizazi

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#4 GPT-5.5

low
Gharama
$0.068
Muda
37.0s
Tokeni
2,339 tok

#136 Grok 4.20 Multi Agent Beta

medium
Gharama
$0.261
Muda
123.4s
Tokeni
199,344 tok

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 4.41s 606 238 1,020
Grok 4.20 Multi Agent Beta 6.9 5.8 75.0% 2 3.46s 90,925 33,706 33,077
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 15.04s 7,302 423 6,402
Grok 4.20 Multi Agent Beta 3.3 3.3 33.3% 0 27.11s 13,212 86 13,141
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 9.56s 11,019 303 717
Grok 4.20 Multi Agent Beta 3.0 10.0 0.0% 0 0ms 0 0 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 3.28s 7,140 228 157
Grok 4.20 Multi Agent Beta 10.0 10.0 100.0% 0 5.54s 97,232 25,306 25,051
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 5.3 10.0 33.3% 0 28.05s 723 69 11,609
Grok 4.20 Multi Agent Beta 2.9 7.2 11.1% 1 24.67s 328,253 164,609 163,647
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 5.17s 477 133 245
Grok 4.20 Multi Agent Beta 5.8 2.8 66.7% 1 6.40s 41,387 15,848 15,746
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 9.9 10.0 100.0% 0 3.74s 660 93 415
Grok 4.20 Multi Agent Beta 9.8 10.0 100.0% 0 3.52s 43,923 19,752 19,617
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 4.74s 642 279 954
Grok 4.20 Multi Agent Beta 6.7 7.9 55.6% 1 5.19s 107,020 35,361 35,095
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 10.0 10.0 100.0% 0 4.96s 5,445 250 101
Grok 4.20 Multi Agent Beta 3.0 10.0 0.0% 0 0ms 0 0 0
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
GPT-5.5 3.0 10.0 0.0% 0 10.06s 195 30 840
Grok 4.20 Multi Agent Beta 0.0 0.0 0.0% 0 0ms 0 0 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho