Kategoria ya AI BENCHY
Orodha ya Uchanganuzi na uchimbaji wa data
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uchanganuzi na uchimbaji wa data, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Majaribio sahihi ↓.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uchanganuzi na uchimbaji wa data
8.7
Modeli bora
Gemini 3 Flash Preview 10.0| Nafasi | Modeli | Kampuni | Alama ya Uchanganuzi na uchimbaji wa data | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #57 | Step 3.7 Flash low | Stepfun | 7.3 | 7.3 | 1/2 | 2.29s |
| #64 | MiMo-V2-Flash medium | Xiaomi | 6.5 | 7.2 | 1/2 | 0ms |
| #66 | Qwen3.5-35B-A3B medium | Qwen | 7.3 | 7.1 | 1/2 | 59.3s |
| #68 | Claude Opus 4.8 none | Anthropic | 7.3 | 7.0 | 1/2 | 1.77s |
| #75 | Ring-2.6-1T medium | Inclusionai | 6.5 | 6.9 | 1/2 | 37.4s |
| #81 | Mercury 2 medium | Inception | 7.3 | 6.6 | 1/2 | 1.11s |
| #82 | Hy3 preview high | Tencent | 6.5 | 6.6 | 1/2 | 12.1s |
| #89 | Hy3 preview low | Tencent | 6.5 | 6.4 | 1/2 | 5.85s |
| #99 | gpt-oss-120b medium | OpenAI | 6.4 | 6.1 | 1/2 | 1.98s |
| #103 | DeepSeek V4 Pro high | DeepSeek | 7.3 | 6.0 | 1/2 | 23.6s |
| #107 | Laguna Xs.2 medium | Poolside | 7.1 | 5.8 | 1/2 | 9.34s |
| #113 | DeepSeek V4 Pro none | DeepSeek | 6.9 | 5.7 | 1/2 | 30.5s |
| #118 | Qwen3.6 27B none | Qwen | 7.3 | 5.6 | 1/2 | 2.06s |
| #119 | Cobuddy medium | Baidu | 6.3 | 5.6 | 1/2 | 17.4s |
| #122 | GLM 4.7 Flash none | Z.ai | 7.3 | 5.5 | 1/2 | 4.82s |