Kategoria ya AI BENCHY
Orodha ya Mchanganyiko
Ona ni modeli gani za AI zinafanya vizuri zaidi katika Mchanganyiko, zipi zinabaki thabiti, na pengo kubwa liko wapi. Panga kwa: Muda wa majibu (wastani) ↑.
| Nafasi | Modeli | Kampuni | Alama ya Mchanganyiko | Alama | Majaribio sahihi | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|
| #58 | GLM 5V Turbo none | Z.ai | 3.0 | 6.2 | 0/1 | 6.51s |
| #65 | MiMo-V2-Pro none | Xiaomi | 3.0 | 6.0 | 0/1 | 6.58s |
| #61 | Seed-2.0-Lite none | Bytedance Seed | 3.0 | 6.2 | 0/1 | 6.59s |
| #49 | Qwen3.5 Plus 2026-02-15 none | Qwen | 3.0 | 6.8 | 0/1 | 6.65s |
| #89 | GPT-4o-mini none | OpenAI | 3.0 | 4.9 | 0/1 | 7.58s |
| #78 | Trinity Large Preview none | Arcee AI | 3.0 | 5.3 | 0/1 | 8.91s |
| #28 | GPT-5.2 Chat none | OpenAI | 10.0 | 7.9 | 1/1 | 9.12s |
| #67 | Qwen3.5-27B none | Qwen | 2.8 | 5.9 | 0/1 | 9.39s |
| #12 | Gemini 3 PRO Preview medium | 3.0 | 8.4 | 0/1 | 10.4s | |
| #22 | Gemini 3.1 Flash Lite Preview low | 3.0 | 8.1 | 0/1 | 11.9s | |
| #36 | GPT-5.3 Chat none | OpenAI | 10.0 | 7.7 | 1/1 | 12.0s |
| #18 | GLM 5 Turbo medium | Z.ai | 10.0 | 8.1 | 1/1 | 13.9s |
| #40 | GPT-5.2 medium | OpenAI | 10.0 | 7.5 | 1/1 | 14.1s |
| #17 | Gemini 3.1 Flash Lite Preview medium | 10.0 | 8.2 | 1/1 | 14.9s | |
| #31 | GLM 5V Turbo medium | Z.ai | 6.9 | 7.8 | 0/1 | 15.1s |