AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

OpenAI: GPT-4o-mini

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo OpenAI: GPT-4o-mini
Nafasi#19
KampuniOpenAI
Score 4.00
Uthabiti 9.98
Gharama kwa matokeo 0.0576
Jumla ya gharama $0.00173
Majaribio sahihi 3/12
Kiwango cha kupita kwa kila jaribio 25.0%
Majaribio yasiyo thabiti 0
Tokeni za matokeo 570
Tokeni za hoja 0

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 0/2 1.00 10.00 0.0% 0 - $0.00005
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 - $0.00115
Domain specific 0/3 1.00 10.00 0.0% 0 - $0.00012
Instructions following 1/2 5.50 10.00 50.0% 0 - $0.00015
Puzzle Solving 0/3 4.00 9.92 0.0% 0 - $0.00028

Modeli zilizolinganishwa

Linganisha OpenAI: GPT-4o-mini dhidi ya...

#18 · Stepfun

StepFun: Step 3.5 Flash

Uchambuzi (medium)

Score: 4.92

Uthabiti: 7.34

Kiwango cha kupita kwa kila jaribio: 58.3%

Majaribio yasiyo thabiti: 4

Gharama kwa matokeo: 0.0000

Majaribio sahihi: 5/12

Jumla ya gharama: $0.00000

Linganisha

#20 · Z.ai

Z.ai: GLM 4.7 Flash

Uchambuzi (medium)

Score: 3.92

Uthabiti: 6.51

Kiwango cha kupita kwa kila jaribio: 50.0%

Majaribio yasiyo thabiti: 5

Gharama kwa matokeo: 0.2253

Majaribio sahihi: 4/12

Jumla ya gharama: $0.00902

Linganisha

#17 · MiniMax

MiniMax: MiniMax M2.5

Uchambuzi (medium)

Score: 5.08

Uthabiti: 6.00

Kiwango cha kupita kwa kila jaribio: 61.1%

Majaribio yasiyo thabiti: 6

Gharama kwa matokeo: 4.0276

Majaribio sahihi: 5/12

Jumla ya gharama: $0.20138

Linganisha

Ulinganisho wa haraka

Linganisha OpenAI: GPT-4o-mini dhidi ya...