AI BENCHY
Linganisha
❤️ Made by XCS

Jina la modeli

Google: Gemini 3.1 Pro Preview

Benchmark zimetengenezwa kutoka seti za majaribio za Aibenchy tarehe : 19 Feb 2026

Kipimo Google: Gemini 3.1 Pro Preview
Nafasi#2
KampuniGoogle
Score 9.25
Uthabiti 10.00
Gharama kwa matokeo 2.5543
Jumla ya gharama $0.28097
Majaribio sahihi 11/12
Kiwango cha kupita kwa kila jaribio 91.7%
Majaribio yasiyo thabiti 0
Tokeni za matokeo 632
Tokeni za hoja 21,277

Mgawanyo wa kategoria

Kategoria Majaribio yaliyopita kikamilifu Score Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Alama ya hoja Gharama
Anti-AI Tricks 2/2 10.00 10.00 100.0% 0 5.75 $0.02289
Data parsing and extraction 2/2 10.00 10.00 100.0% 0 9.50 $0.05541
Domain specific 2/3 7.00 10.00 66.7% 0 5.63 $0.12975
Instructions following 2/2 10.00 10.00 100.0% 0 5.67 $0.03134
Puzzle Solving 3/3 10.00 10.00 100.0% 0 8.89 $0.04159

Modeli zilizolinganishwa

Linganisha Google: Gemini 3.1 Pro Preview dhidi ya...

#1 · Google

Google: Gemini 3 Flash Preview

Uchambuzi (medium)

Score: 9.92

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 100.0%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 0.8502

Majaribio sahihi: 12/12

Jumla ya gharama: $0.10203

Linganisha

#3 · Google

Google: Gemini 3 Pro Preview

Uchambuzi (medium)

Score: 8.42

Uthabiti: 10.00

Kiwango cha kupita kwa kila jaribio: 83.3%

Majaribio yasiyo thabiti: 0

Gharama kwa matokeo: 0.8028

Majaribio sahihi: 10/12

Jumla ya gharama: $0.08029

Linganisha

#4 · Qwen

Qwen: Qwen3.5 Plus 2026-02-15

Uchambuzi (medium)

Score: 8.42

Uthabiti: 9.30

Kiwango cha kupita kwa kila jaribio: 86.1%

Majaribio yasiyo thabiti: 1

Gharama kwa matokeo: 2.3151

Majaribio sahihi: 10/12

Jumla ya gharama: $0.23151

Linganisha

Ulinganisho wa haraka

Linganisha Google: Gemini 3.1 Pro Preview dhidi ya...