Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

OpenAI: gpt-oss-120b vs Qwen: Qwen3.5 Plus 2026-02-15

Jina la modeli:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe : 2026-02-27 15:16

Muhtasari

Kipimo OpenAI: gpt-oss-120b medium Toleo: Tarehe ya kutolewa haijulikani Inapatikana bure Qwen: Qwen3.5 Plus 2026-02-15 medium Toleo: Tarehe ya kutolewa haijulikani
Nafasi #25 #4
Alama 5.64 8.64
Uthabiti 7.55 10.00
Gharama kwa matokeo 0.101 1.955
Jumla ya gharama $0.008 $0.235
Majaribio sahihi
Majaribio yenye makosa 7 2
Kiwango cha kupita kwa kila jaribio 59.5% 85.7%
Majaribio yasiyo thabiti 4 0
Tokeni za matokeo 11,407 1,258
Tokeni za hoja 26,106 93,374

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 7.00 9.81 66.7% 0 3,463 2,077
Qwen: Qwen3.5 Plus 2026-02-15 10.00 10.00 100.0% 0 186 5,926
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 5.50 5.87 66.7% 1 241 1,114
Qwen: Qwen3.5 Plus 2026-02-15 10.00 10.00 100.0% 0 283 14,892
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 1.00 4.41 22.2% 2 6,018 18,520
Qwen: Qwen3.5 Plus 2026-02-15 4.00 10.00 33.3% 0 56 39,882
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 10.00 10.00 100.0% 0 120 1,770
Qwen: Qwen3.5 Plus 2026-02-15 9.50 9.99 100.0% 0 102 9,257
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 5.00 7.13 44.4% 1 1,278 1,542
Qwen: Qwen3.5 Plus 2026-02-15 10.00 10.00 100.0% 0 322 22,508
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: gpt-oss-120b 9.00 9.97 100.0% 0 287 1,083
Qwen: Qwen3.5 Plus 2026-02-15 10.00 10.00 100.0% 0 309 909

Badilisha jozi ya ulinganisho