AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Pengetahuan umum

Lihat model AI mana yang paling baik di Pengetahuan umum, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↑.

Model yang ditampilkan

15

Rata-rata Skor Pengetahuan umum

2.9

Peringkat Model Perusahaan Skor Pengetahuan umum Skor Tes benar Waktu respons (rata-rata)
#45 Qwen3.5-Flash medium Qwen 3.0 7.6 0/1 49.0s
#31 Qwen3.5-122B-A10B medium Qwen 3.0 7.9 0/1 52.9s
#48 DeepSeek V4 Flash high DeepSeek 3.0 7.6 0/1 54.5s
#85 Nemotron 3 Super medium NVIDIA 3.0 6.1 0/1 55.3s
#56 Seed-2.0-Mini medium Bytedance Seed 3.0 7.2 0/1 56.8s
#9 Qwen3.6 Max Preview medium Qwen 3.0 8.5 0/1 60.6s
#52 Claude Opus 4.6 medium Anthropic 3.0 7.4 0/1 63.2s
#66 Grok 4.20 medium X AI 3.0 6.9 0/1 63.5s
#19 GLM 5 medium Z.ai 3.0 8.1 0/1 67.4s
#106 MiniMax M2.5 medium Minimax 3.0 5.5 0/1 80.8s
#70 Qwen3.6 27B medium Qwen 3.0 6.8 0/1 81.0s
#37 MiMo-V2-Pro medium Xiaomi 3.0 7.7 0/1 82.7s
#69 Kimi K2.5 medium Moonshot AI 3.0 6.8 0/1 83.9s
#59 DeepSeek V3.2 medium DeepSeek 3.0 7.2 0/1 84.0s
#17 Qwen3.5-27B medium Qwen 3.0 8.1 0/1 85.1s

Model teratas menurut Skor Pengetahuan umum

Skor Pengetahuan umum vs total biaya

Model teratas menurut Waktu respons (rata-rata)