Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

OpenAI: GPT-5.4 vs Qwen: Qwen3.5-27B

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-05

Kipimo OpenAI: GPT-5.4 none Toleo: 2026-03-05 Qwen: Qwen3.5-27B none Toleo: 2026-02-24
Nafasi #44 #41
Wastani wa alama 4.6 4.9
Majaribio sahihi
Uthabiti 8.9 9.0
Gharama kwa matokeo 1.496 0.297
Jumla ya gharama $0.090 $0.015
Kiwango cha kupita kwa kila jaribio 44.4% 40.0%
Majaribio yasiyo thabiti 2 2
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 1,635 3,035
Tokeni za hoja 0 0
Muda wa majibu (wastani) 1.46s 1.70s
Muda wa majibu (upeo) 2.89s 9.39s
Muda wa majibu (jumla) 21.86s 25.55s

Modeli bora kwa alama

Muda wa majibu (wastani)

Alama dhidi ya gharama ya jumla

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 7.3 11.1% 1 1.41s 388 0
Qwen: Qwen3.5-27B 4.0 10.0 33.3% 0 796ms 264 0
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 0.0% 0 2.89s 291 0
Qwen: Qwen3.5-27B 10.0 1.6 33.3% 1 9.39s 1,461 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 9.9 10.0 100.0% 0 1.04s 222 0
Qwen: Qwen3.5-27B 9.9 10.0 100.0% 0 1.43s 243 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 7.2 44.4% 1 1.07s 50 0
Qwen: Qwen3.5-27B 10.0 10.0 0.0% 0 540ms 15 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 5.5 10.0 50.0% 0 1.07s 81 0
Qwen: Qwen3.5-27B 4.5 10.0 0.0% 0 815ms 69 0
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 9.8 33.3% 0 1.52s 357 0
Qwen: Qwen3.5-27B 6.3 7.9 55.6% 1 1.37s 680 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 100.0% 0 2.75s 246 0
Qwen: Qwen3.5-27B 10.0 10.0 100.0% 0 3.54s 303 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho