Kushindwa kwa AI BENCHY
Kushindwa kwa Muda umeisha
Ona ni modeli gani za AI hukutana na Muda umeisha mara nyingi zaidi ili utambue hatari za utegemevu kabla ya kuchagua. Panga kwa: Muda wa majibu (wastani) ↓.
| Nafasi | Modeli | Kampuni | Idadi ya Muda umeisha | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #79 | Hunter Alpha medium | OpenRouter | 2 | 6.7 | 8/18 | 10.3s |
| #150 | Qwen3 Coder Next medium | Qwen | 1 | 4.6 | 4/21 | 8.58s |
| #102 | Gemma 4 26B A4B none | 1 | 6.0 | 8/21 | 5.91s | |
| #11 | Claude Opus 4.7 medium | Anthropic | 1 | 8.7 | 17/21 | 4.73s |