Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

Qwen: Qwen3.6 Max Preview vs xAI: Grok Build 0.1

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-21

Kipimo Qwen3.6 Max Preview Qwen3.6 Max Preview none Toleo: 2026-04-20 Grok Build 0.1 Grok Build 0.1 medium Toleo: 2026-05-21
Alama 7.2 7.8
Nafasi #60 #41
Uaminifu 10.0 10.0
Uthabiti 9.1 8.9
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 64.9% 71.9%
Majaribio yasiyo thabiti 2 3
Jumla ya uendeshaji 57 57
Gharama kwa matokeo 0.755 4.064
Jumla ya gharama $0.083 $0.488
Bei ya ingizo $1.040 / 1M $1.000 / 1M
Bei ya toleo $6.240 / 1M $2.000 / 1M
Tokeni za matokeo 4,751 1,947
Tokeni za hoja 0 223,372
Muda wa majibu (wastani) 3.31s 22.28s
Muda wa majibu (upeo) 20.51s 88.28s
Muda wa majibu (jumla) 62.80s 423.30s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 5.2 7.9 41.7% 1 2.63s 513 0
Grok Build 0.1 10.0 10.0 100.0% 0 5.46s 195 9,825
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 5.0 2.0 66.7% 1 3.45s 426 0
Grok Build 0.1 7.3 3.7 66.7% 1 30.98s 354 17,734
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 20.51s 2,842 0
Grok Build 0.1 10.0 10.0 100.0% 0 30.81s 231 18,779
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.87s 243 0
Grok Build 0.1 10.0 10.0 100.0% 0 7.76s 180 10,343
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 7.7 10.0 66.7% 0 1.22s 18 0
Grok Build 0.1 5.3 10.0 33.3% 0 77.75s 501 111,807
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 4.3 10.0 0.0% 0 1.62s 76 0
Grok Build 0.1 3.8 2.5 33.3% 1 10.14s 78 5,386
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 9.8 10.0 100.0% 0 1.45s 69 0
Grok Build 0.1 9.8 10.0 100.0% 0 9.62s 57 12,436
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.38s 323 0
Grok Build 0.1 6.2 7.5 55.6% 1 8.67s 161 15,476
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 5.27s 222 0
Grok Build 0.1 10.0 10.0 100.0% 0 9.40s 180 5,319
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 1.97s 19 0
Grok Build 0.1 3.0 10.0 0.0% 0 26.07s 10 16,267

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho