Kategoria ya AI BENCHY
Orodha ya Mwito wa zana
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Mwito wa zana, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↓.
| Nafasi | Modeli | Kampuni | Alama ya Mwito wa zana | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #121 | Owl Alpha none | Openrouter | 10.0 | 5.5 | 1/1 | 22.8s |
| #103 | DeepSeek V4 Pro high | DeepSeek | 10.0 | 6.0 | 1/1 | 21.3s |
| #138 | Ling-2.6-flash none | Inclusionai | 3.0 | 5.0 | 0/1 | 18.8s |
| #54 | GPT-5 Mini medium | OpenAI | 10.0 | 7.3 | 1/1 | 18.6s |
| #14 | Qwen3.6 Max Preview medium | Qwen | 10.0 | 8.5 | 1/1 | 18.3s |
| #89 | Hy3 preview low | Tencent | 2.8 | 6.4 | 0/1 | 17.8s |
| #38 | Grok 4.3 medium | X AI | 10.0 | 7.6 | 1/1 | 17.7s |
| #79 | Hunter Alpha medium | OpenRouter | 10.0 | 6.7 | 1/1 | 17.3s |
| #78 | Qwen3.6 27B medium | Qwen | 10.0 | 6.8 | 1/1 | 16.9s |
| #43 | MiMo-V2.5-Pro medium | Xiaomi | 10.0 | 7.5 | 1/1 | 16.9s |
| #141 | Nemotron 3 Super none | NVIDIA | 4.7 | 4.9 | 0/1 | 16.0s |
| #158 | GLM 4.7 Flash medium | Z.ai | 10.0 | 4.4 | 1/1 | 15.9s |
| #17 | GLM 5 medium | Z.ai | 10.0 | 8.3 | 1/1 | 15.9s |
| #129 | MiniMax M2.5 medium | Minimax | 10.0 | 5.3 | 1/1 | 15.4s |
| #33 | Hy3 preview medium | Tencent | 10.0 | 7.7 | 1/1 | 15.0s |