Kushindwa kwa kategoria za AI BENCHY
Uandishi wa msimbo: Hitilafu ya API
Uandishi wa msimbo
Hitilafu ya API
Ona ni modeli gani za AI zina uwezekano mkubwa wa kupata Hitilafu ya API katika Uandishi wa msimbo, ili uone udhaifu haraka. Panga kwa: Majaribio sahihi ↓.
Sababu za kushindwa
| Nafasi | Modeli | Kampuni | Idadi ya Hitilafu ya API | Alama ya kategoria | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #10 | Gemini 3 PRO Preview medium | 1 | 3.0 | 0/1 | 0ms | |
| #18 | Qwen3.6 Plus medium | Qwen | 1 | 3.0 | 0/1 | 0ms |
| #47 | Hunter Alpha medium | OpenRouter | 1 | 3.0 | 0/1 | 0ms |
| #48 | Nemotron 3 Super medium | NVIDIA | 1 | 3.0 | 0/1 | 0ms |
| #68 | Hunter Alpha none | OpenRouter | 1 | 3.0 | 0/1 | 0ms |
| #93 | Step 3.5 Flash none | Stepfun | 1 | 3.0 | 0/1 | 0ms |