Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↓.
| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #68 | GPT-5 Mini medium | OpenAI | 10.0 | 7.2 | 2/2 | 30.7s |
| #66 | Claude Opus 4.6 medium | Anthropic | 7.2 | 7.2 | 1/2 | 29.4s |
| #39 | Hy3 preview low | Tencent | 10.0 | 7.7 | 1/1 | 27.9s |
| #84 | Grok 4.20 Multi Agent Beta medium | X AI | 10.0 | 6.6 | 1/1 | 27.1s |
| #2 | Gemini 3.5 Flash high | 10.0 | 9.6 | 2/2 | 24.6s | |
| #132 | DeepSeek V4 Flash none | DeepSeek | 4.8 | 5.1 | 0/2 | 24.5s |
| #88 | Grok 4.1 Fast medium | X AI | 2.3 | 6.5 | 0/1 | 23.6s |
| #60 | GPT-5.2 medium | OpenAI | 10.0 | 7.3 | 2/2 | 23.2s |
| #5 | Qwen3.7 Max medium | Qwen | 10.0 | 9.0 | 2/2 | 23.0s |
| #80 | Grok Build 0.1 none | X AI | 10.0 | 6.6 | 1/1 | 21.4s |
| #67 | GPT-5.4 Nano medium | OpenAI | 6.8 | 7.2 | 1/2 | 21.1s |
| #109 | DeepSeek V3.2 none | DeepSeek | 3.1 | 5.7 | 0/2 | 20.9s |
| #101 | Owl Alpha medium | Openrouter | 6.6 | 5.8 | 1/2 | 19.1s |
| #16 | GPT-5.3-Codex medium | OpenAI | 10.0 | 8.3 | 2/2 | 18.5s |
| #77 | Gemma 4 31B none | 6.8 | 6.7 | 1/2 | 14.8s |