Kategoria ya AI BENCHY
Orodha ya Akili ya jumla
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Akili ya jumla, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Majaribio sahihi ↑.
| Nafasi | Modeli | Kampuni | Alama ya Akili ya jumla | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #147 | GPT-4o-mini none | OpenAI | 4.0 | 4.8 | 0/1 | 909ms |
| #148 | GPT-5.4 Nano none | OpenAI | 3.8 | 4.7 | 0/1 | 1.31s |
| #149 | Nemotron 3 Nano Omni 30b A3b Reasoning medium | NVIDIA | 3.0 | 4.6 | 0/1 | 0ms |
| #150 | Qwen3 Coder Next medium | Qwen | 6.3 | 4.6 | 0/1 | 1.39s |
| #151 | Trinity Large Preview none | Arcee AI | 4.5 | 4.6 | 0/1 | 873ms |
| #152 | MiMo-V2-Flash none | Xiaomi | 4.6 | 4.6 | 0/1 | 1.67s |
| #153 | Qwen3.6 35B A3B none | Qwen | 4.4 | 4.6 | 0/1 | 3.51s |
| #154 | Qwen3.5-9B none | Qwen | 4.4 | 4.6 | 0/1 | 552ms |
| #155 | Mercury 2 none | Inception | 4.8 | 4.5 | 0/1 | 628ms |
| #156 | Hy3 preview none | Tencent | 4.1 | 4.4 | 0/1 | 16.1s |
| #157 | Grok 4.1 Fast none | X AI | 4.4 | 4.4 | 0/1 | 1.08s |
| #158 | GLM 4.7 Flash medium | Z.ai | 3.6 | 4.4 | 0/1 | 18.1s |
| #159 | Ling-2.6-1T none | Inclusionai | 5.0 | 4.3 | 0/1 | 20.3s |
| #160 | LFM2-24B-A2B none | Liquid | 4.0 | 4.2 | 0/1 | 395ms |
| #161 | Qwen3.5-9B medium | Qwen | 2.8 | 4.2 | 0/1 | 226.4s |