AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

Z.ai: GLM 4.7 Flash

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo Z.ai: GLM 4.7 Flash
Nafasi#20
KampuniZ.ai
Score 3.92
Uthabiti 6.51
Gharama kwa matokeo 0.2253
Jumla ya gharama $0.00902
Majaribio sahihi 4/12
Kiwango cha kupita kwa kila jaribio 50.0%
Majaribio yasiyo thabiti 5
Tokeni za matokeo 7,601
Tokeni za hoja 18,390

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 1/2 5.50 5.81 66.7% 1 9.08 $0.00131
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.87 $0.00281
Domain specific 0/3 1.00 4.41 33.3% 2 8.21 $0.00183
Instructions following 1/2 5.00 5.81 66.7% 1 9.50 $0.00105
Puzzle Solving 0/3 1.00 7.20 11.1% 1 7.33 $0.00203

Modeli zilizolinganishwa

Linganisha Z.ai: GLM 4.7 Flash dhidi ya...

#19 · OpenAI

OpenAI: GPT-4o-mini

Bila uchambuzi

Score: 4.00

Uthabiti: 9.98

Kiwango cha kupita kwa kila jaribio: 25.0%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 0.0576

Majaribio sahihi: 3/12

Jumla ya gharama: $0.00173

Linganisha

#21 · Xiaomi

Xiaomi: MiMo-V2-Flash

Uchambuzi (medium)

Score: 3.92

Uthabiti: 7.89

Kiwango cha kupita kwa kila jaribio: 44.4%

Majaribio yasiyo thabiti: 3

Gharama kwa matokeo: 0.4829

Majaribio sahihi: 4/12

Jumla ya gharama: $0.01932

Linganisha

#18 · Stepfun

StepFun: Step 3.5 Flash

Uchambuzi (medium)

Score: 4.92

Uthabiti: 7.34

Kiwango cha kupita kwa kila jaribio: 58.3%

Majaribio yasiyo thabiti: 4

Gharama kwa matokeo: 0.0000

Majaribio sahihi: 5/12

Jumla ya gharama: $0.00000

Linganisha

Ulinganisho wa haraka

Linganisha Z.ai: GLM 4.7 Flash dhidi ya...