AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kushindwa kwa AI BENCHY

Kushindwa kwa Hitilafu ya API

Ona ni modeli gani za AI hukutana na Hitilafu ya API mara nyingi zaidi ili utambue hatari za utegemevu kabla ya kuchagua. Panga kwa: Majaribio sahihi ↓.

Modeli zilizoonyeshwa

15

Jumla ya kushindwa

144

Modeli iliyoathirika zaidi

Gemini 3.5 Flash 3
Nafasi Modeli Kampuni Idadi ya Hitilafu ya API Alama Majaribio sahihi Muda wa majibu (wastani)
#75 Ring-2.6-1T medium Inclusionai 2 6.9 11/21 61.3s
#82 Hy3 preview high Tencent 7 6.6 11/21 56.6s
#83 Step 3.5 Flash none Stepfun 4 6.6 6/12 39.0s
#80 Mimo V2 Omni medium Xiaomi 1 6.7 10/21 41.2s
#85 Gemma 4 31B none Google 2 6.5 10/21 4.05s
#89 Hy3 preview low Tencent 7 6.4 10/21 24.6s
#92 Laguna M.1 medium Poolside 4 6.4 9/19 14.7s
#93 Qwen3.6 Plus Preview medium Qwen 8 6.3 9/19 15.2s
#79 Hunter Alpha medium OpenRouter 1 6.7 8/18 10.3s
#84 Grok 4.20 Multi Agent Beta medium X AI 2 6.6 8/18 9.69s
#96 Ring-2.6-1T none Inclusionai 5 6.2 9/21 55.1s
#101 Mimo V2 Omni none Xiaomi 1 6.0 8/21 2.44s
#103 DeepSeek V4 Pro high DeepSeek 5 6.0 8/21 65.2s
#105 Nemotron 3 Super medium NVIDIA 3 5.8 8/21 32.0s
#111 Owl Alpha medium Openrouter 1 5.7 8/21 11.9s

Modeli bora kwa Idadi ya Hitilafu ya API

Idadi ya Hitilafu ya API dhidi ya Alama

Modeli bora kwa Muda wa majibu (wastani)