AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

Google: Gemini 3 Flash Preview

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo Google: Gemini 3 Flash Preview
Nafasi#1
KampuniGoogle
Score 9.92
Uthabiti 10.00
Gharama kwa matokeo 0.8502
Jumla ya gharama $0.10203
Majaribio sahihi 12/12
Kiwango cha kupita kwa kila jaribio 100.0%
Majaribio yasiyo thabiti 0
Tokeni za matokeo 590
Tokeni za hoja 31,913

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 2/2 10.00 10.00 100.0% 0 7.17 $0.00544
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.17 $0.02077
Domain specific 3/3 10.00 10.00 100.0% 0 5.56 $0.04625
Instructions following 2/2 10.00 10.00 100.0% 0 5.50 $0.01281
Puzzle Solving 3/3 9.67 10.00 100.0% 0 6.50 $0.01679

Modeli zilizolinganishwa

Linganisha Google: Gemini 3 Flash Preview dhidi ya...

#2 · Google

Google: Gemini 3.1 Pro Preview

Uchambuzi (medium)

Score: 9.25

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 91.7%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 2.5543

Majaribio sahihi: 11/12

Jumla ya gharama: $0.28097

Linganisha

#3 · Google

Google: Gemini 3 Pro Preview

Uchambuzi (medium)

Score: 8.42

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 83.3%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 0.8028

Majaribio sahihi: 10/12

Jumla ya gharama: $0.08029

Linganisha

#4 · Qwen

Qwen: Qwen3.5 Plus 2026-02-15

Uchambuzi (medium)

Score: 8.42

Uthabiti: 9.30

Kiwango cha kupita kwa kila jaribio: 86.1%

Majaribio yasiyo thabiti: 1

Gharama kwa matokeo: 2.3151

Majaribio sahihi: 10/12

Jumla ya gharama: $0.23151

Linganisha

Ulinganisho wa haraka

Linganisha Google: Gemini 3 Flash Preview dhidi ya...