Kategoria ya AI BENCHY
Orodha ya Maarifa ya jumla
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Maarifa ya jumla, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Jumla ya gharama ↓.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Maarifa ya jumla
3.1
Modeli bora
Grok 4.20 Multi Agent Beta 0.0
169/169
Chuja miundo
Hakuna miundo inayolingana na utafutaji na vichujio vya sasa.
| Nafasi | Modeli | Kampuni | Alama ya Maarifa ya jumla | Alama | Jumla ya gharama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|---|
| #148 | Qwen3 Coder Next medium | Qwen | 3.0 | 4.7 | $0.008 | 0/1 | 399ms |
| #137 | Trinity Large Preview none | Arcee AI | 3.0 | 5.0 | $0.008 | 0/1 | 777ms |
| #161 | Grok 4.1 Fast none | X AI | 3.0 | 4.0 | $0.008 | 0/1 | 731ms |
| #117 | DeepSeek V4 Flash none | DeepSeek | 3.0 | 5.5 | $0.007 | 0/1 | 3.07s |
| #129 | Mistral Small 4 none | Mistral | 3.0 | 5.1 | $0.007 | 0/1 | 397ms |
| #134 | MiMo-V2.5 none | Xiaomi | 3.0 | 5.1 | $0.007 | 0/1 | 3.89s |
| #142 | Nemotron 3 Super none | NVIDIA | 3.0 | 4.9 | $0.007 | 0/1 | 8.94s |
| #139 | GPT-4o-mini none | OpenAI | 3.0 | 5.0 | $0.006 | 0/1 | 794ms |
| #135 | Qwen3.5-9B none | Qwen | 3.0 | 5.1 | $0.006 | 0/1 | 2.32s |
| #147 | Ling-2.6-1T none | Inclusionai | 3.0 | 4.7 | $0.005 | 0/1 | 0ms |
| #97 | Qwen3.5-Flash none | Qwen | 3.0 | 6.1 | $0.005 | 0/1 | 588ms |
| #141 | GLM 4.7 Flash none | Z.ai | 3.0 | 4.9 | $0.004 | 0/1 | 692ms |
| #121 | Gemma 4 26B A4B none | 3.0 | 5.5 | $0.004 | 0/1 | 778ms | |
| #98 | Gemma 4 31B none | 3.0 | 6.1 | $0.004 | 0/1 | 1.25s | |
| #163 | Granite 4.1 8B none | IBM Granite | 3.0 | 4.0 | $0.003 | 0/1 | 306ms |