AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kushindwa kwa kategoria za AI BENCHY

Uandishi wa msimbo: Hakuna jibu

Uandishi wa msimbo
Hakuna jibu

Ona ni modeli gani za AI zina uwezekano mkubwa wa kupata Hakuna jibu katika Uandishi wa msimbo, ili uone udhaifu haraka.

Modeli zilizoonyeshwa

15

Jumla ya kushindwa

18

Modeli iliyoathirika zaidi

Gemini 3 PRO Preview 1
Nafasi Modeli Kampuni Idadi ya Hakuna jibu Alama ya kategoria Majaribio sahihi Muda wa majibu (wastani)
#19 Gemini 3 PRO Preview medium Google 1 3.0 0/2 0ms
#23 Gemma 4 31B medium Google 1 3.8 0/2 110.9s
#28 GLM 5 Turbo medium Z.ai 1 7.3 1/2 53.9s
#30 Qwen3.6 35B A3B medium Qwen 1 6.6 1/2 59.3s
#47 Gemma 4 26B A4B medium Google 1 2.9 0/2 258.4s
#51 GLM 5.1 medium Z.ai 1 4.7 0/2 145.6s
#54 Kimi K2.6 medium Moonshot AI 1 6.5 1/2 118.2s
#58 Step 3.5 Flash medium Stepfun 1 3.0 0/1 62.8s
#70 Qwen3.5-35B-A3B medium Qwen 1 6.5 1/2 244.5s
#72 MiMo-V2-Omni medium Xiaomi 1 3.4 0/2 183.9s
#79 Kimi K2.5 medium Moonshot AI 1 4.1 0/2 215.9s
#80 DeepSeek V4 Pro high DeepSeek 1 2.8 0/2 51.8s
#83 Qwen3.6 27B medium Qwen 1 6.6 1/2 165.4s
#122 Elephant Alpha medium Openrouter 1 4.0 0/2 1.30s
#130 Elephant Alpha none Openrouter 1 4.7 0/2 1.39s

Modeli bora kwa Idadi ya Hakuna jibu

Idadi ya Hakuna jibu dhidi ya Alama

Modeli bora kwa Muda wa majibu (wastani)

Modeli bora kwa Gharama iliyopotezwa inayokadiriwa