Kategoria ya AI BENCHY
Orodha ya Mbinu za kupinga AI
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Mbinu za kupinga AI, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Majaribio sahihi ↑.
| Nafasi | Modeli | Kampuni | Alama ya Mbinu za kupinga AI | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #59 | Qwen3.5-Flash none | Qwen | 3.5 | 6.2 | 0/4 | 1.32s |
| #61 | Seed-2.0-Lite none | Bytedance Seed | 3.0 | 6.2 | 0/4 | 2.43s |
| #62 | Gemini 2.5 Flash none | 3.0 | 6.2 | 0/4 | 582ms | |
| #63 | Qwen3.5-35B-A3B none | Qwen | 3.4 | 6.1 | 0/4 | 1.43s |
| #64 | DeepSeek V3.2 none | DeepSeek | 3.2 | 6.1 | 0/4 | 7.63s |
| #65 | MiMo-V2-Pro none | Xiaomi | 3.5 | 6.0 | 0/4 | 1.80s |
| #66 | GPT-5.4 none | OpenAI | 3.2 | 5.9 | 0/4 | 1.21s |
| #72 | Hunter Alpha none | OpenRouter | 3.5 | 5.7 | 0/4 | 3.81s |
| #75 | GLM 5.1 none | Z.ai | 4.0 | 5.6 | 0/4 | 2.11s |
| #76 | Kimi K2.5 none | Moonshot AI | 3.6 | 5.5 | 0/4 | 6.24s |
| #77 | GLM 5 Turbo none | Z.ai | 3.0 | 5.5 | 0/4 | 2.84s |
| #78 | Trinity Large Preview none | Arcee AI | 3.0 | 5.3 | 0/4 | 3.02s |
| #79 | Grok 4.20 Beta none | X AI | 4.0 | 5.3 | 0/4 | 597ms |
| #83 | Mistral Small 4 none | Mistral | 3.4 | 5.2 | 0/4 | 395ms |
| #86 | GPT-5.4 Mini none | OpenAI | 3.1 | 5.1 | 0/4 | 929ms |