Kategoria ya AI BENCHY
Orodha ya Akili ya jumla
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Akili ya jumla, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↑.
| Nafasi | Modeli | Kampuni | Alama ya Akili ya jumla | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #34 | Qwen3.7 Max none | Qwen | 10.0 | 7.7 | 1/1 | 1.04s |
| #118 | Qwen3.6 27B none | Qwen | 5.2 | 5.6 | 0/1 | 1.07s |
| #157 | Grok 4.1 Fast none | X AI | 4.4 | 4.4 | 0/1 | 1.08s |
| #131 | Qwen3.5-122B-A10B none | Qwen | 5.0 | 5.3 | 0/1 | 1.12s |
| #48 | Gemini 3 Flash Preview none | 10.0 | 7.4 | 1/1 | 1.13s | |
| #117 | Qwen3.5-35B-A3B none | Qwen | 6.5 | 5.6 | 0/1 | 1.19s |
| #148 | GPT-5.4 Nano none | OpenAI | 3.8 | 4.7 | 0/1 | 1.31s |
| #88 | Qwen3.7 Plus none | Qwen | 5.3 | 6.4 | 0/1 | 1.33s |
| #140 | Qwen3 Coder Next none | Qwen | 10.0 | 4.9 | 1/1 | 1.34s |
| #61 | Gemini 3.1 Flash Lite low | 4.0 | 7.2 | 0/1 | 1.37s | |
| #150 | Qwen3 Coder Next medium | Qwen | 6.3 | 4.6 | 0/1 | 1.39s |
| #114 | Qwen3.5 Plus 2026-04-20 none | Qwen | 4.8 | 5.7 | 0/1 | 1.41s |
| #138 | Ling-2.6-flash none | Inclusionai | 4.0 | 5.0 | 0/1 | 1.45s |
| #50 | Gemini 3.1 Flash Lite Preview low | 4.0 | 7.4 | 0/1 | 1.54s | |
| #124 | Kimi K2.6 none | Moonshot AI | 5.4 | 5.5 | 0/1 | 1.55s |