Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↓.
| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #114 | GPT-5.4 none | OpenAI | 6.8 | 5.6 | 1/2 | 1.99s |
| #106 | Qwen3.5-27B none | Qwen | 7.3 | 5.8 | 1/2 | 1.98s |
| #129 | Laguna Xs.2 none | Poolside | 2.5 | 5.3 | 0/1 | 1.96s |
| #115 | MiMo-V2.5-Pro none | Xiaomi | 5.0 | 5.6 | 0/2 | 1.80s |
| #150 | Grok 4.1 Fast none | X AI | 5.3 | 4.4 | 0/1 | 1.79s |
| #104 | Qwen3.5-35B-A3B none | Qwen | 6.8 | 5.8 | 1/2 | 1.72s |
| #53 | Gemini 3.1 Flash Lite low | 6.8 | 7.4 | 1/2 | 1.71s | |
| #46 | Gemini 3.1 Flash Lite Preview low | 6.8 | 7.6 | 1/2 | 1.56s | |
| #86 | GPT-5.5 none | OpenAI | 6.8 | 6.5 | 1/2 | 1.52s |
| #131 | Elephant Alpha none | Openrouter | 4.7 | 5.2 | 0/2 | 1.39s |
| #27 | Qwen3.7 Max none | Qwen | 6.8 | 7.9 | 1/2 | 1.39s |
| #122 | Elephant Alpha medium | Openrouter | 4.0 | 5.4 | 0/2 | 1.30s |
| #145 | Nemotron 3 Nano Omni 30b A3b Reasoning none | NVIDIA | 10.0 | 4.6 | 1/1 | 1.27s |
| #120 | Grok 4.20 none | X AI | 3.4 | 5.4 | 0/1 | 1.22s |
| #142 | Qwen3 Coder Next medium | Qwen | 4.1 | 4.7 | 0/2 | 1.17s |