Kategoria ya AI BENCHY
Orodha ya Uchanganuzi na uchimbaji wa data
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uchanganuzi na uchimbaji wa data, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↓.
Modeli zilizoonyeshwa
13
Wastani wa Alama ya Uchanganuzi na uchimbaji wa data
8.7
Modeli bora
Qwen3.5-9B 3.6| Nafasi | Modeli | Kampuni | Alama ya Uchanganuzi na uchimbaji wa data | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #157 | Grok 4.1 Fast none | X AI | 10.0 | 4.4 | 2/2 | 943ms |
| #154 | Qwen3.5-9B none | Qwen | 10.0 | 4.6 | 2/2 | 847ms |
| #90 | Gemini 3.1 Flash Lite none | 10.0 | 6.4 | 2/2 | 843ms | |
| #142 | Mistral Small 4 none | Mistral | 10.0 | 4.9 | 2/2 | 822ms |
| #160 | LFM2-24B-A2B none | Liquid | 3.0 | 4.2 | 0/2 | 714ms |
| #155 | Mercury 2 none | Inception | 7.3 | 4.5 | 1/2 | 667ms |
| #97 | Gemini 2.5 Flash none | 10.0 | 6.2 | 2/2 | 652ms | |
| #146 | Laguna Xs.2 none | Poolside | 10.0 | 4.8 | 2/2 | 646ms |
| #106 | Grok 4.20 Beta none | X AI | 10.0 | 5.8 | 2/2 | 601ms |
| #163 | Granite 4.1 8B none | IBM Granite | 3.0 | 4.0 | 0/2 | 575ms |
| #127 | Grok 4.20 none | X AI | 10.0 | 5.4 | 2/2 | 522ms |
| #64 | MiMo-V2-Flash medium | Xiaomi | 6.5 | 7.2 | 1/2 | 0ms |
| #83 | Step 3.5 Flash none | Stepfun | 3.0 | 6.6 | 0/1 | 0ms |