AI BENCHY
Your ad here

Kategoria ya AI BENCHY

Orodha ya Utatuzi wa mafumbo

Ona ni modeli gani za AI zinafanya vizuri zaidi katika Utatuzi wa mafumbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.

Modeli zilizoonyeshwa

15

Wastani wa Alama ya Utatuzi wa mafumbo

6.4

Nafasi Modeli Kampuni Alama ya Utatuzi wa mafumbo Alama Majaribio sahihi Muda wa majibu (wastani)
#70 Qwen3.5-122B-A10B none Qwen 5.4 5.7 1/3 982ms
#86 GPT-5.4 Mini none OpenAI 5.4 5.1 1/3 860ms
#78 Trinity Large Preview none Arcee AI 5.4 5.3 1/3 3.30s
#46 Kimi K2.5 medium Moonshot AI 5.3 7.0 1/3 45.4s
#30 Step 3.5 Flash medium Stepfun 5.3 7.9 1/3 7.72s
#58 GLM 5V Turbo none Z.ai 5.3 6.2 1/3 2.22s
#52 Grok 4.1 Fast medium X AI 5.3 6.7 1/3 8.08s
#57 GPT-5 Nano medium OpenAI 5.3 6.3 1/3 19.8s
#71 MiniMax M2.5 medium Minimax 5.3 5.7 1/3 11.5s
#82 Grok 4.20 none X AI 5.3 5.2 1/3 487ms
#61 Seed-2.0-Lite none Bytedance Seed 5.2 6.2 1/3 2.46s
#34 Kimi K2.6 medium Moonshot AI 5.0 7.7 0/3 25.6s
#84 gpt-oss-120b none OpenAI 4.5 5.2 0/3 6.86s
#98 LFM2-24B-A2B none Liquid 4.4 4.1 0/3 1.69s
#74 GLM 4.7 Flash none Z.ai 4.4 5.6 0/3 1.00s

Modeli bora kwa Alama ya Utatuzi wa mafumbo

Alama ya Utatuzi wa mafumbo dhidi ya jumla ya gharama

Modeli bora kwa Muda wa majibu (wastani)