Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↑.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uandishi wa msimbo
6.1
Modeli bora
Qwen3.6 Plus Preview 0.0| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #7 | Claude Opus 4.7 medium | Anthropic | 10.0 | 8.9 | 2/2 | 14.8s |
| #77 | Gemma 4 31B none | 6.8 | 6.7 | 1/2 | 14.8s | |
| #16 | GPT-5.3-Codex medium | OpenAI | 10.0 | 8.3 | 2/2 | 18.5s |
| #101 | Owl Alpha medium | Openrouter | 6.6 | 5.8 | 1/2 | 19.1s |
| #109 | DeepSeek V3.2 none | DeepSeek | 3.1 | 5.7 | 0/2 | 20.9s |
| #67 | GPT-5.4 Nano medium | OpenAI | 6.8 | 7.2 | 1/2 | 21.1s |
| #80 | Grok Build 0.1 none | X AI | 10.0 | 6.6 | 1/1 | 21.4s |
| #5 | Qwen3.7 Max medium | Qwen | 10.0 | 9.0 | 2/2 | 23.0s |
| #60 | GPT-5.2 medium | OpenAI | 10.0 | 7.3 | 2/2 | 23.2s |
| #88 | Grok 4.1 Fast medium | X AI | 2.3 | 6.5 | 0/1 | 23.6s |
| #132 | DeepSeek V4 Flash none | DeepSeek | 4.8 | 5.1 | 0/2 | 24.5s |
| #2 | Gemini 3.5 Flash high | 10.0 | 9.6 | 2/2 | 24.6s | |
| #84 | Grok 4.20 Multi Agent Beta medium | X AI | 10.0 | 6.6 | 1/1 | 27.1s |
| #39 | Hy3 preview low | Tencent | 10.0 | 7.7 | 1/1 | 27.9s |
| #66 | Claude Opus 4.6 medium | Anthropic | 7.2 | 7.2 | 1/2 | 29.4s |