Kategoria ya AI BENCHY
Orodha ya Uandishi wa msimbo
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uandishi wa msimbo, zipi zinabaki thabiti, na pengo kubwa liko wapi.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uandishi wa msimbo
6.1
Modeli bora
Gemini 3.5 Flash 10.0| Nafasi | Modeli | Kampuni | Alama ya Uandishi wa msimbo | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #126 | Nemotron 3 Nano Omni 30b A3b Reasoning medium | NVIDIA | 3.3 | 5.4 | 0/1 | 38.1s |
| #139 | GPT-4o-mini none | OpenAI | 3.2 | 4.9 | 0/2 | 2.05s |
| #109 | DeepSeek V3.2 none | DeepSeek | 3.1 | 5.7 | 0/2 | 20.9s |
| #96 | Nemotron 3 Super medium | NVIDIA | 3.1 | 5.9 | 0/2 | 62.4s |
| #20 | Gemini 3 PRO Preview medium | 3.0 | 8.1 | 0/2 | 0ms | |
| #34 | Step 3.5 Flash none | Stepfun | 3.0 | 7.8 | 0/1 | 0ms |
| #76 | Hunter Alpha medium | OpenRouter | 3.0 | 6.7 | 0/1 | 0ms |
| #112 | Hunter Alpha none | OpenRouter | 3.0 | 5.7 | 0/1 | 0ms |
| #58 | Step 3.5 Flash medium | Stepfun | 3.0 | 7.4 | 0/1 | 62.8s |
| #31 | Gemma 4 26B A4B medium | 2.9 | 7.8 | 0/2 | 258.4s | |
| #151 | Qwen3.5-9B medium | Qwen | 2.8 | 4.2 | 0/2 | 135.6s |
| #83 | DeepSeek V4 Pro high | DeepSeek | 2.8 | 6.6 | 0/2 | 51.8s |
| #129 | Laguna Xs.2 none | Poolside | 2.5 | 5.3 | 0/1 | 1.96s |
| #88 | Grok 4.1 Fast medium | X AI | 2.3 | 6.5 | 0/1 | 23.6s |
| #147 | Hy3 preview none | Tencent | 2.3 | 4.6 | 0/1 | 4.56s |