Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

OpenAI: GPT-5 Mini vs Qwen: Qwen3.5-Flash

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-03

Kipimo OpenAI: GPT-5 Mini medium Toleo: 2025-08-07 Qwen: Qwen3.5-Flash medium Toleo: 2026-02-24
Nafasi #33 #32
Wastani wa alama 5.77 5.79
Uthabiti 8.79 7.60
Gharama kwa matokeo 1.200 0.650
Jumla ya gharama $0.084 $0.046
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 57.1% 66.7%
Majaribio yasiyo thabiti 2 4
Tokeni za matokeo 4,723 1,194
Tokeni za hoja 35,392 108,368

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 7.00 9.62 66.7% 0 1,645 5,824
Qwen: Qwen3.5-Flash 10.00 10.00 100.0% 0 363 23,645
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 9.88 10.00 100.0% 0 453 3,200
Qwen: Qwen3.5-Flash 5.50 5.87 83.3% 1 235 16,237
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 1.00 7.21 22.2% 1 293 14,016
Qwen: Qwen3.5-Flash 1.00 4.41 33.3% 2 52 34,605
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 7.00 6.64 66.7% 1 318 4,992
Qwen: Qwen3.5-Flash 7.50 9.91 50.0% 0 98 14,139
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 4.33 9.78 33.3% 0 1,527 5,760
Qwen: Qwen3.5-Flash 4.00 7.21 55.6% 1 137 18,458
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Mini 10.00 10.00 100.0% 0 487 1,600
Qwen: Qwen3.5-Flash 10.00 10.00 100.0% 0 309 1,284

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho