AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

Qwen: Qwen3.5 Plus 2026-02-15

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo Qwen: Qwen3.5 Plus 2026-02-15
Nafasi#4
KampuniQwen
Score 8.42
Uthabiti 9.30
Gharama kwa matokeo 2.3151
Jumla ya gharama $0.23151
Majaribio sahihi 10/12
Kiwango cha kupita kwa kila jaribio 86.1%
Majaribio yasiyo thabiti 1
Tokeni za matokeo 802
Tokeni za hoja 93,972

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 2/2 10.00 10.00 100.0% 0 9.17 $0.00855
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.61 $0.03952
Domain specific 1/3 4.00 7.21 44.4% 1 7.28 $0.10564
Instructions following 2/2 9.50 9.99 100.0% 0 9.33 $0.02275
Puzzle Solving 3/3 10.00 10.00 100.0% 0 8.28 $0.05508

Modeli zilizolinganishwa

Linganisha Qwen: Qwen3.5 Plus 2026-02-15 dhidi ya...

#3 · Google

Google: Gemini 3 Pro Preview

Uchambuzi (medium)

Score: 8.42

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 83.3%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 0.8028

Majaribio sahihi: 10/12

Jumla ya gharama: $0.08029

Linganisha

#5 · OpenAI

OpenAI: GPT-5.2

Uchambuzi (medium)

Score: 7.92

Uthabiti: 9.30

Kiwango cha kupita kwa kila jaribio: 80.6%

Majaribio yasiyo thabiti: 1

Gharama kwa matokeo: 2.2838

Majaribio sahihi: 9/12

Jumla ya gharama: $0.20554

Linganisha

#2 · Google

Google: Gemini 3.1 Pro Preview

Uchambuzi (medium)

Score: 9.25

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 91.7%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 2.5543

Majaribio sahihi: 11/12

Jumla ya gharama: $0.28097

Linganisha

Ulinganisho wa haraka

Linganisha Qwen: Qwen3.5 Plus 2026-02-15 dhidi ya...