Kategoria ya AI BENCHY
Orodha ya Maarifa ya jumla
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Maarifa ya jumla, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Majaribio sahihi ↑.
169/169
Chuja miundo
Hakuna miundo inayolingana na utafutaji na vichujio vya sasa.
| Nafasi | Modeli | Kampuni | Alama ya Maarifa ya jumla | Alama | Jumla ya gharama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|---|
| #82 | Gemini 3.1 Flash Lite Preview low | 3.0 | 6.5 | $0.026 | 0/1 | 1.35s | |
| #83 | Gemini 3.1 Flash Lite high | 0.0 | 6.5 | $2.044 | 0/0 | 0ms | |
| #84 | Gemini 3.1 Flash Lite Preview none | 3.0 | 6.4 | $0.018 | 0/1 | 814ms | |
| #85 | Gemini 3.1 Flash Lite low | 3.0 | 6.4 | $0.028 | 0/1 | 1.46s | |
| #86 | Hy3 preview low | Tencent | 3.0 | 6.4 | $0.018 | 0/1 | 41.7s |
| #87 | Nemotron 3 Super medium | NVIDIA | 3.0 | 6.3 | $0.021 | 0/1 | 55.3s |
| #88 | Gemma 4 31B medium | 3.0 | 6.3 | $0.033 | 0/1 | 90.1s | |
| #89 | Qwen3.5-35B-A3B medium | Qwen | 3.0 | 6.3 | $0.401 | 0/1 | 177.4s |
| #90 | GPT-5.5 none | OpenAI | 3.0 | 6.3 | $0.231 | 0/1 | 5.01s |
| #91 | Gemini 3 PRO Preview medium | 3.0 | 6.2 | $0.385 | 0/1 | 0ms | |
| #92 | Seed-2.0-Lite none | Bytedance Seed | 3.0 | 6.2 | $0.019 | 0/1 | 1.96s |
| #93 | Gemini 2.5 Flash none | 3.0 | 6.2 | $0.016 | 0/1 | 1.15s | |
| #94 | Gemini 3.1 Flash Lite minimal | 3.0 | 6.1 | $0.013 | 0/1 | 724ms | |
| #95 | Gemini 3.1 Flash Lite Preview high | 0.0 | 6.1 | $2.310 | 0/0 | 0ms | |
| #96 | Gemini 3.1 Flash Lite none | 3.0 | 6.1 | $0.013 | 0/1 | 733ms |