Kategoria ya AI BENCHY
Orodha ya Uchanganuzi na uchimbaji wa data
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Uchanganuzi na uchimbaji wa data, zipi zinabaki thabiti, na pengo kubwa liko wapi.
Modeli zilizoonyeshwa
15
Wastani wa Alama ya Uchanganuzi na uchimbaji wa data
8.7
Modeli bora
DeepSeek V4 Flash 10.0| Nafasi | Modeli | Kampuni | Alama ya Uchanganuzi na uchimbaji wa data | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #31 | DeepSeek V4 Flash high | DeepSeek | 10.0 | 7.7 | 2/2 | 28.0s |
| #62 | Step 3.5 Flash medium | Stepfun | 10.0 | 7.2 | 2/2 | 15.0s |
| #139 | DeepSeek V4 Flash none | DeepSeek | 10.0 | 5.0 | 2/2 | 23.8s |
| #1 | Gemini 3 Flash Preview medium | 10.0 | 9.8 | 2/2 | 5.43s | |
| #2 | Gemini 3.5 Flash high | 10.0 | 9.6 | 2/2 | 6.43s | |
| #3 | Gemini 3.5 Flash low | 10.0 | 9.4 | 2/2 | 1.81s | |
| #4 | Gemini 3.1 Pro Preview medium | 10.0 | 9.4 | 2/2 | 7.72s | |
| #5 | Qwen3.7 Max medium | Qwen | 10.0 | 9.1 | 2/2 | 8.80s |
| #6 | GPT-5.5 low | OpenAI | 10.0 | 9.0 | 2/2 | 3.28s |
| #7 | Gemini 3.5 Flash medium | 10.0 | 9.0 | 2/2 | 4.07s | |
| #8 | Claude Opus 4.7 none | Anthropic | 10.0 | 8.9 | 2/2 | 2.15s |
| #9 | GPT-5.5 medium | OpenAI | 10.0 | 8.8 | 2/2 | 4.18s |
| #11 | Claude Opus 4.7 medium | Anthropic | 10.0 | 8.7 | 2/2 | 2.37s |
| #12 | Gemini 3.1 Flash Lite Preview high | 10.0 | 8.6 | 2/2 | 7.16s | |
| #13 | Grok 4.20 Beta medium | X AI | 10.0 | 8.5 | 2/2 | 4.01s |