AI BENCHY
Your ad here

Kategoria ya AI BENCHY

Orodha ya Utatuzi wa mafumbo

Ona ni modeli gani za AI zinafanya vizuri zaidi katika Utatuzi wa mafumbo, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↓.

Modeli zilizoonyeshwa

15

Wastani wa Alama ya Utatuzi wa mafumbo

6.4

Modeli bora

Qwen3.5-27B 8.2
Nafasi Modeli Kampuni Alama ya Utatuzi wa mafumbo Alama Majaribio sahihi Muda wa majibu (wastani)
#1 Gemini 3 Flash Preview medium Google 10.0 10.0 3/3 4.43s
#28 GPT-5.2 Chat none OpenAI 7.7 7.9 2/3 4.42s
#44 GPT-5.4 Mini medium OpenAI 6.8 7.3 1/3 4.33s
#15 Gemini 2.5 Flash medium Google 7.7 8.2 2/3 3.94s
#12 Gemini 3 PRO Preview medium Google 10.0 8.4 3/3 3.91s
#47 Grok 4.20 medium X AI 6.4 7.0 1/3 3.89s
#35 MiMo-V2-Omni medium Xiaomi 6.5 7.7 1/3 3.88s
#25 Grok 4.20 Beta medium X AI 8.2 8.0 2/3 3.85s
#41 MiMo-V2-Flash medium Xiaomi 7.7 7.5 2/3 3.77s
#38 GPT-5.4 Nano medium OpenAI 4.0 7.6 0/3 3.65s
#17 Gemini 3.1 Flash Lite Preview medium Google 7.7 8.2 2/3 3.58s
#78 Trinity Large Preview none Arcee AI 5.4 5.3 1/3 3.30s
#72 Hunter Alpha none OpenRouter 5.8 5.7 1/3 3.06s
#48 Gemma 4 31B none Google 5.5 6.9 1/3 2.95s
#36 GPT-5.3 Chat none OpenAI 10.0 7.7 3/3 2.93s

Modeli bora kwa Alama ya Utatuzi wa mafumbo

Alama ya Utatuzi wa mafumbo dhidi ya jumla ya gharama

Modeli bora kwa Muda wa majibu (wastani)