Kategoria ya AI BENCHY
Orodha ya Akili ya jumla
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Akili ya jumla, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Kipimo ↑.
| Nafasi | Modeli | Kampuni | Alama ya Akili ya jumla | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #19 | Seed-2.0-Lite medium | Bytedance Seed | 6.7 | 8.2 | 0/1 | 18.2s |
| #79 | Hunter Alpha medium | OpenRouter | 7.0 | 6.7 | 0/1 | 6.44s |
| #1 | Gemini 3 Flash Preview medium | 10.0 | 9.8 | 1/1 | 5.19s | |
| #2 | Gemini 3.5 Flash high | 10.0 | 9.6 | 1/1 | 3.63s | |
| #3 | Gemini 3.5 Flash low | 10.0 | 9.4 | 1/1 | 2.27s | |
| #4 | Gemini 3.1 Pro Preview medium | 10.0 | 9.4 | 1/1 | 11.8s | |
| #5 | Qwen3.7 Max medium | Qwen | 10.0 | 9.1 | 1/1 | 11.7s |
| #6 | GPT-5.5 low | OpenAI | 10.0 | 9.0 | 1/1 | 5.17s |
| #7 | Gemini 3.5 Flash medium | 10.0 | 9.0 | 1/1 | 2.52s | |
| #8 | Claude Opus 4.7 none | Anthropic | 10.0 | 8.9 | 1/1 | 3.47s |
| #9 | GPT-5.5 medium | OpenAI | 10.0 | 8.8 | 1/1 | 4.16s |
| #10 | Claude Opus 4.8 medium | Anthropic | 10.0 | 8.7 | 1/1 | 2.46s |
| #11 | Claude Opus 4.7 medium | Anthropic | 10.0 | 8.7 | 1/1 | 2.87s |
| #12 | Gemini 3.1 Flash Lite Preview high | 10.0 | 8.6 | 1/1 | 5.25s | |
| #13 | Grok 4.20 Beta medium | X AI | 10.0 | 8.5 | 1/1 | 5.78s |