Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uandishi wa msimbo
5.7
Modeli bora
Gemini 3.5 Flash 10.0| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #2 | Gemini 3.5 Flash high | 10.0 | 9.6 | 3/3 | 23.0s | |
| #5 | Qwen3.7 Max medium | Qwen | 10.0 | 9.1 | 3/3 | 35.3s |
| #6 | GPT-5.5 low | OpenAI | 10.0 | 9.0 | 3/3 | 15.0s |
| #8 | Claude Opus 4.7 none | Anthropic | 10.0 | 8.9 | 1/1 | 2.84s |
| #10 | Claude Opus 4.8 medium | Anthropic | 10.0 | 8.7 | 3/3 | 15.3s |
| #13 | Grok 4.20 Beta medium | X AI | 10.0 | 8.5 | 1/1 | 31.4s |
| #15 | GPT-5.3-Codex medium | OpenAI | 10.0 | 8.4 | 3/3 | 19.5s |
| #17 | GLM 5 medium | Z.ai | 10.0 | 8.3 | 3/3 | 74.3s |
| #42 | GPT-5.2 medium | OpenAI | 10.0 | 7.5 | 3/3 | 22.7s |
| #53 | Gemini 3.1 Flash Lite high | 10.0 | 7.3 | 1/1 | 137.6s | |
| #54 | GPT-5 Mini medium | OpenAI | 10.0 | 7.3 | 3/3 | 27.6s |
| #84 | Grok 4.20 Multi Agent Beta medium | X AI | 10.0 | 6.6 | 1/1 | 27.1s |
| #100 | Grok Build 0.1 none | X AI | 10.0 | 6.0 | 1/1 | 21.4s |
| #162 | Nemotron 3 Nano Omni 30b A3b Reasoning none | NVIDIA | 10.0 | 4.1 | 1/1 | 1.27s |
| #9 | GPT-5.5 medium | OpenAI | 8.8 | 8.8 | 2/3 | 59.8s |