AI BENCHY
Your ad here

Kategoria ya AI BENCHY

Orodha ya Utatuzi wa mafumbo

Ona ni modeli gani za AI zinafanya vizuri zaidi katika Utatuzi wa mafumbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.

Modeli zilizoonyeshwa

15

Wastani wa Alama ya Utatuzi wa mafumbo

6.4

Nafasi Modeli Kampuni Alama ya Utatuzi wa mafumbo Alama Majaribio sahihi Muda wa majibu (wastani)
#32 Qwen3.5-Flash medium Qwen 6.4 7.8 1/3 56.7s
#43 Qwen3.5-35B-A3B medium Qwen 6.4 7.4 1/3 31.6s
#47 Grok 4.20 medium X AI 6.4 7.0 1/3 3.89s
#50 Hunter Alpha medium OpenRouter 6.1 6.7 1/3 5.36s
#65 MiMo-V2-Pro none Xiaomi 6.0 6.0 1/3 1.83s
#79 Grok 4.20 Beta none X AI 5.9 5.3 1/3 541ms
#72 Hunter Alpha none OpenRouter 5.8 5.7 1/3 3.06s
#60 Gemma 4 26B A4B none Google 5.7 6.2 1/3 739ms
#62 Gemini 2.5 Flash none Google 5.7 6.2 1/3 576ms
#75 GLM 5.1 none Z.ai 5.7 5.6 1/3 1.48s
#88 Nemotron 3 Super none NVIDIA 5.7 5.1 1/3 7.50s
#45 GPT-5 Mini medium OpenAI 5.6 7.0 1/3 14.1s
#66 GPT-5.4 none OpenAI 5.6 5.9 1/3 1.52s
#48 Gemma 4 31B none Google 5.5 6.9 1/3 2.95s
#77 GLM 5 Turbo none Z.ai 5.5 5.5 1/3 2.43s

Modeli bora kwa Alama ya Utatuzi wa mafumbo

Alama ya Utatuzi wa mafumbo dhidi ya jumla ya gharama

Modeli bora kwa Muda wa majibu (wastani)