Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

Modeli zilizolinganishwa

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-22

Kipimo Qwen3.5 Plus 2026-02-15 Qwen3.5 Plus 2026-02-15 medium Toleo: 2026-02-15 Qwen3.5-27B Qwen3.5-27B medium Toleo: 2026-02-24 GPT-5.4 GPT-5.4 medium Toleo: 2026-03-05
Alama 8.1 7.9 7.9
Nafasi #20 #25 #27
Uaminifu 10.0 10.0 10.0
Uthabiti 8.8 8.9 8.5
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 76.7% 73.3% 75.0%
Majaribio yasiyo thabiti 3 3 4
Jumla ya uendeshaji 60 60 60
Gharama kwa matokeo 2.259 4.664 8.767
Jumla ya gharama $0.317 $0.607 $1.140
Bei ya ingizo $0.260 / 1M $0.195 / 1M $2.500 / 1M
Bei ya toleo $1.560 / 1M $1.560 / 1M $15.000 / 1M
Tokeni za matokeo 2,184 2,572 2,222
Tokeni za hoja 173,206 312,011 68,503
Muda wa majibu (wastani) 67.90s 60.85s 22.31s
Muda wa majibu (upeo) 266.69s 177.36s 100.41s
Muda wa majibu (jumla) 882.70s 1216.93s 446.15s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 8.2 7.9 83.3% 1 45.78s 205 21,236
Qwen3.5-27B 8.7 7.9 91.7% 1 19.75s 569 31,505
GPT-5.4 8.3 10.0 75.0% 0 4.11s 240 1,511
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 7.6 6.7 66.7% 1 193.80s 406 63,554
Qwen3.5-27B 7.0 9.8 50.0% 0 123.86s 416 64,993
GPT-5.4 8.2 6.7 83.3% 1 54.98s 412 19,995
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 46.85s 421 7,906
Qwen3.5-27B 10.0 10.0 100.0% 0 163.96s 483 9,991
GPT-5.4 10.0 10.0 100.0% 0 20.57s 301 3,543
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 46.91s 270 14,916
Qwen3.5-27B 10.0 10.0 100.0% 0 30.26s 270 16,150
GPT-5.4 10.0 10.0 100.0% 0 5.32s 234 804
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 5.3 10.0 33.3% 0 17.50s 35 16,680
Qwen3.5-27B 5.3 10.0 33.3% 0 79.53s 43 52,368
GPT-5.4 5.3 7.2 44.4% 1 74.27s 61 34,748
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 4.7 1.6 66.7% 1 79.86s 73 8,675
Qwen3.5-27B 6.1 3.1 66.7% 1 101.41s 70 23,147
GPT-5.4 4.7 3.1 33.3% 1 4.92s 145 321
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 31.93s 101 7,704
Qwen3.5-27B 10.0 10.0 100.0% 0 19.66s 97 11,638
GPT-5.4 10.0 10.0 100.0% 0 3.11s 93 897
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 34.57s 340 14,496
Qwen3.5-27B 8.2 7.7 77.8% 1 64.61s 245 77,213
GPT-5.4 8.2 7.2 88.9% 1 9.13s 442 3,832
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 7.54s 309 909
Qwen3.5-27B 10.0 10.0 100.0% 0 7.45s 348 1,323
GPT-5.4 10.0 10.0 100.0% 0 13.28s 264 1,031
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Qwen3.5 Plus 2026-02-15 3.0 10.0 0.0% 0 103.81s 24 17,130
Qwen3.5-27B 3.0 10.0 0.0% 0 85.11s 31 23,683
GPT-5.4 3.0 10.0 0.0% 0 13.95s 30 1,821

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho