Urambazaji
AI BENCHY
Linganisha Chati Mbinu
❤️ Made by XCS
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

OpenAI: GPT-5.4 vs Qwen: Qwen3.5-35B-A3B

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-06

Kipimo OpenAI: GPT-5.4 none Toleo: 2026-03-05 Qwen: Qwen3.5-35B-A3B medium Toleo: 2026-02-24
Wastani wa alama 4.6 5.8
Nafasi #45 #34
Majaribio sahihi
Uthabiti 8.9 6.7
Gharama kwa matokeo 1.496 4.189
Jumla ya gharama $0.090 $0.336
Kiwango cha kupita kwa kila jaribio 44.4% 80.0%
Majaribio yasiyo thabiti 2 6
common.totalRuns 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 1,635 5,475
Tokeni za hoja 0 165,513
Muda wa majibu (wastani) 1.46s 44.84s
Muda wa majibu (upeo) 2.89s 106.00s
Muda wa majibu (jumla) 21.86s 672.55s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 7.3 11.1% 1 1.41s 388 0
Qwen: Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 21.75s 429 36,235
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 0.0% 0 2.89s 291 0
Qwen: Qwen3.5-35B-A3B 10.0 1.6 66.7% 1 75.34s 775 12,485
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 9.9 10.0 100.0% 0 1.04s 222 0
Qwen: Qwen3.5-35B-A3B 5.5 5.9 83.3% 1 59.33s 235 19,493
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 7.2 44.4% 1 1.07s 50 0
Qwen: Qwen3.5-35B-A3B 10.0 4.4 44.5% 2 88.34s 41 46,368
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 5.5 10.0 50.0% 0 1.07s 81 0
Qwen: Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 24.45s 97 17,361
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 9.8 33.3% 0 1.52s 357 0
Qwen: Qwen3.5-35B-A3B 4.0 4.4 77.8% 2 31.58s 3,589 32,206
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 100.0% 0 2.75s 246 0
Qwen: Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 4.65s 309 1,365

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho