Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Qwen: Qwen3.5-27B vs xAI: Grok 4.3

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-01

Kipimo Qwen3.5-27B Qwen3.5-27B medium Toleo: 2026-02-24 Grok 4.3 Grok 4.3 medium Toleo: 2026-05-01
Alama 8.4 8.2
Nafasi #16 #20
Uaminifu Haipo 10.0
Uthabiti 8.8 8.6
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 81.5% 81.5%
Majaribio yasiyo thabiti 3 3
Jumla ya uendeshaji 54 54
Gharama kwa matokeo 3.822 3.974
Jumla ya gharama $0.497 $0.517
Bei ya ingizo $0.195 / 1M $1.250 / 1M
Bei ya toleo $1.560 / 1M $2.500 / 1M
Tokeni za matokeo 2,500 1,223
Tokeni za hoja 242,500 187,047
Muda wa majibu (wastani) 53.03s 48.63s
Muda wa majibu (upeo) 163.96s 216.69s
Muda wa majibu (jumla) 954.46s 875.27s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 8.7 7.9 91.7% 1 19.75s 569 31,505
Grok 4.3 10.0 10.0 100.0% 0 8.83s 88 8,207
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 10.0 10.0 100.0% 0 70.35s 375 19,165
Grok 4.3 10.0 10.0 100.0% 0 45.72s 284 9,659
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 10.0 10.0 100.0% 0 163.96s 483 9,991
Grok 4.3 10.0 10.0 100.0% 0 63.99s 234 15,301
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 10.0 10.0 100.0% 0 30.26s 270 16,150
Grok 4.3 10.0 10.0 100.0% 0 18.97s 180 9,546
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 5.3 10.0 33.3% 0 79.53s 43 52,368
Grok 4.3 5.3 7.2 44.4% 1 181.74s 14 111,300
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 6.1 3.1 66.7% 1 101.41s 70 23,147
Grok 4.3 5.4 2.5 66.7% 1 24.70s 70 5,020
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 10.0 10.0 100.0% 0 19.66s 97 11,638
Grok 4.3 9.8 10.0 100.0% 0 18.58s 57 8,713
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 8.2 7.7 77.8% 1 64.61s 245 77,213
Grok 4.3 5.9 7.2 55.6% 1 22.53s 128 14,686
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5-27B 10.0 10.0 100.0% 0 7.45s 348 1,323
Grok 4.3 10.0 10.0 100.0% 0 17.66s 168 4,615

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho