AI BENCHY
Your ad here

Kategoria ya AI BENCHY

Orodha ya Utatuzi wa mafumbo

Ona ni modeli gani za AI zinafanya vizuri zaidi katika Utatuzi wa mafumbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.

Modeli zilizoonyeshwa

15

Wastani wa Alama ya Utatuzi wa mafumbo

6.4

Nafasi Modeli Kampuni Alama ya Utatuzi wa mafumbo Alama Majaribio sahihi Muda wa majibu (wastani)
#38 GPT-5.4 Nano medium OpenAI 4.0 7.6 0/3 3.65s
#54 Mercury 2 medium Inception 3.9 6.5 0/3 934ms
#63 Qwen3.5-35B-A3B none Qwen 3.9 6.1 0/3 1.34s
#80 MiniMax M2.7 medium Minimax 3.8 5.3 0/3 25.6s
#96 GPT-5.4 Nano none OpenAI 3.7 4.5 0/3 1.29s
#81 Elephant medium Openrouter 3.7 5.2 0/3 867ms
#89 GPT-4o-mini none OpenAI 3.7 4.9 0/3 1.30s
#94 MiMo-V2-Flash none Xiaomi 3.6 4.5 0/3 1.38s
#51 Nemotron 3 Super medium NVIDIA 3.5 6.7 0/3 8.39s
#69 Kimi K2.6 none Moonshot AI 3.4 5.8 0/3 1.66s
#73 Mistral Small 4 medium Mistral 3.4 5.7 0/3 2.00s
#59 Qwen3.5-Flash none Qwen 3.3 6.2 0/3 5.90s
#85 Elephant none Openrouter 3.3 5.2 0/3 849ms
#90 Qwen3.5-9B none Qwen 3.2 4.8 0/3 683ms
#95 Grok 4.1 Fast none X AI 3.2 4.5 0/3 1.28s

Modeli bora kwa Alama ya Utatuzi wa mafumbo

Alama ya Utatuzi wa mafumbo dhidi ya jumla ya gharama

Modeli bora kwa Muda wa majibu (wastani)