AI BENCHY
Advertise here

Kategoria ya AI BENCHY

Orodha ya Utatuzi wa mafumbo

Ona ni modeli gani za AI zinafanya vizuri zaidi katika Utatuzi wa mafumbo, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Majaribio sahihi ↑.

Modeli zilizoonyeshwa

15

Wastani wa Alama ya Utatuzi wa mafumbo

6.7

Modeli bora

GPT-5.4 Nano 4.1
Nafasi Modeli Kampuni Alama ya Utatuzi wa mafumbo Alama Majaribio sahihi Muda wa majibu (wastani)
#126 gpt-oss-120b none OpenAI 6.0 5.4 1/3 8.21s
#127 Grok 4.20 none X AI 5.3 5.4 1/3 473ms
#129 MiniMax M2.5 medium Minimax 5.3 5.3 1/3 11.2s
#130 MiniMax M2.7 medium Minimax 5.9 5.3 1/3 24.9s
#134 GLM 5 Turbo none Z.ai 5.5 5.2 1/3 2.65s
#136 Elephant Alpha medium Openrouter 5.3 5.1 1/3 868ms
#141 Nemotron 3 Super none NVIDIA 5.5 4.9 1/3 2.36s
#143 MiMo-V2.5 none Xiaomi 5.4 4.9 1/3 2.13s
#144 GPT-5.4 Mini none OpenAI 5.4 4.9 1/3 836ms
#146 Laguna Xs.2 none Poolside 5.3 4.8 1/3 650ms
#148 GPT-5.4 Nano none OpenAI 5.4 4.7 1/3 1.25s
#152 MiMo-V2-Flash none Xiaomi 5.3 4.6 1/3 1.86s
#7 Gemini 3.5 Flash medium Google 7.7 9.0 2/3 2.38s
#12 Gemini 3.1 Flash Lite Preview high Google 7.7 8.6 2/3 46.7s
#15 GPT-5.3-Codex medium OpenAI 9.0 8.4 2/3 5.05s

Modeli bora kwa Alama ya Utatuzi wa mafumbo

Alama ya Utatuzi wa mafumbo dhidi ya jumla ya gharama

Modeli bora kwa Muda wa majibu (wastani)