Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uandishi wa msimbo
6.1
Modeli bora
Gemini 3.5 Flash 10.0| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #68 | GPT-5 Mini medium | OpenAI | 10.0 | 7.2 | 2/2 | 30.7s |
| #80 | Grok Build 0.1 none | X AI | 10.0 | 6.6 | 1/1 | 21.4s |
| #84 | Grok 4.20 Multi Agent Beta medium | X AI | 10.0 | 6.6 | 1/1 | 27.1s |
| #130 | Ling-2.6-flash none | Inclusionai | 10.0 | 5.3 | 1/1 | 11.2s |
| #145 | Nemotron 3 Nano Omni 30b A3b Reasoning none | NVIDIA | 10.0 | 4.6 | 1/1 | 1.27s |
| #9 | Gemini 3.5 Flash none | 8.2 | 8.9 | 1/2 | 39.6s | |
| #11 | GPT-5.5 medium | OpenAI | 8.2 | 8.7 | 1/2 | 69.7s |
| #15 | Qwen3.6 Max Preview medium | Qwen | 8.2 | 8.4 | 1/2 | 178.0s |
| #28 | GPT-5.4 medium | OpenAI | 8.2 | 7.9 | 1/2 | 55.0s |
| #30 | GPT-5.2 Chat none | OpenAI | 8.2 | 7.9 | 1/2 | 8.05s |
| #1 | Gemini 3 Flash Preview medium | 7.9 | 9.8 | 1/2 | 96.0s | |
| #21 | Qwen3.5 Plus 2026-02-15 medium | Qwen | 7.6 | 8.1 | 1/2 | 193.8s |
| #49 | MiMo-V2-Pro medium | Xiaomi | 7.5 | 7.6 | 1/2 | 94.2s |
| #61 | GPT-5.4 Mini medium | OpenAI | 7.5 | 7.3 | 1/2 | 73.3s |
| #124 | Laguna M.1 none | Poolside | 7.5 | 5.4 | 0/1 | 2.93s |