AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

Google: Gemini 3 Flash Preview

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo Google: Gemini 3 Flash Preview
Nafasi#10
KampuniGoogle
Score 6.25
Uthabiti 8.60
Gharama kwa matokeo 0.0754
Jumla ya gharama $0.00528
Majaribio sahihi 7/12
Kiwango cha kupita kwa kila jaribio 66.7%
Majaribio yasiyo thabiti 2
Tokeni za matokeo 485
Tokeni za hoja 0

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 1/2 5.50 10.00 50.0% 0 - $0.00016
Data parsing and extraction 1/2 5.50 5.81 83.3% 1 - $0.00357
Domain specific 2/3 7.00 10.00 66.7% 0 - $0.00038
Instructions following 1/2 5.50 5.81 66.7% 1 - $0.00054
Puzzle Solving 2/3 7.00 10.00 66.7% 0 - $0.00066

Modeli zilizolinganishwa

Linganisha Google: Gemini 3 Flash Preview dhidi ya...

#9 · MoonshotAI

MoonshotAI: Kimi K2.5

Uchambuzi (medium)

Score: 6.42

Uthabiti: 8.00

Kiwango cha kupita kwa kila jaribio: 72.2%

Majaribio yasiyo thabiti: 3

Gharama kwa matokeo: 2.4097

Majaribio sahihi: 7/12

Jumla ya gharama: $0.16868

Linganisha

#11 · OpenAI

OpenAI: GPT-5 Nano

Uchambuzi (medium)

Score: 5.92

Uthabiti: 6.03

Kiwango cha kupita kwa kila jaribio: 72.2%

Majaribio yasiyo thabiti: 6

Gharama kwa matokeo: 0.4675

Majaribio sahihi: 6/12

Jumla ya gharama: $0.02806

Linganisha

#8 · X Ai

xAI: Grok 4.1 Fast

Uchambuzi (medium)

Score: 6.42

Uthabiti: 8.60

Kiwango cha kupita kwa kila jaribio: 66.7%

Majaribio yasiyo thabiti: 2

Gharama kwa matokeo: 0.4800

Majaribio sahihi: 7/12

Jumla ya gharama: $0.03360

Linganisha

Ulinganisho wa haraka

Linganisha Google: Gemini 3 Flash Preview dhidi ya...