Kategoria ya AI BENCHY
Orodha ya Uchanganuzi na uchimbaji wa data
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uchanganuzi na uchimbaji wa data, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Jumla ya gharama ↓.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uchanganuzi na uchimbaji wa data
8.8
Modeli bora
Grok 4.20 Multi Agent Beta 10.0
169/169
Chuja miundo
Hakuna miundo inayolingana na utafutaji na vichujio vya sasa.
| Nafasi | Modeli | Kampuni | Alama ya Uchanganuzi na uchimbaji wa data | Alama | Jumla ya gharama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|---|
| #29 | Qwen3.5-27B medium | Qwen | 10.0 | 7.9 | $0.536 | 2/2 | 30.3s |
| #27 | GPT-5.4 Mini medium | OpenAI | 10.0 | 8.0 | $0.526 | 2/2 | 2.43s |
| #3 | Qwen3.7 Max medium | Qwen | 10.0 | 9.4 | $0.523 | 2/2 | 8.80s |
| #49 | Claude Opus 4.7 none | Anthropic | 10.0 | 7.4 | $0.505 | 2/2 | 2.15s |
| #56 | GLM 5V Turbo medium | Z.ai | 10.0 | 7.3 | $0.457 | 2/2 | 9.60s |
| #81 | Qwen3.6 27B medium | Qwen | 3.5 | 6.6 | $0.440 | 0/2 | 37.3s |
| #45 | GPT-5.3 Chat none | OpenAI | 10.0 | 7.5 | $0.433 | 2/2 | 2.21s |
| #89 | Qwen3.5-35B-A3B medium | Qwen | 7.3 | 6.3 | $0.401 | 1/2 | 59.3s |
| #19 | GPT-5.2 Chat none | OpenAI | 10.0 | 8.5 | $0.393 | 2/2 | 3.05s |
| #91 | Gemini 3 PRO Preview medium | 10.0 | 6.2 | $0.385 | 2/2 | 10.8s | |
| #24 | Gemini 2.5 Flash medium | 10.0 | 8.2 | $0.379 | 2/2 | 4.06s | |
| #20 | Step 3.7 Flash medium | Stepfun | 10.0 | 8.5 | $0.376 | 2/2 | 2.75s |
| #5 | Gemini 3.5 Flash low | 10.0 | 9.2 | $0.349 | 2/2 | 1.81s | |
| #43 | Kimi K2.5 medium | Moonshot AI | 10.0 | 7.5 | $0.348 | 2/2 | 49.8s |
| #39 | Step 3.7 Flash low | Stepfun | 7.3 | 7.7 | $0.341 | 1/2 | 2.29s |