AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Spesifik domain

Lihat model AI mana yang paling baik di Spesifik domain, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↑.

Model yang ditampilkan

15

Rata-rata Skor Spesifik domain

4.8

Model terbaik

GLM 5 3.5
Peringkat Model Perusahaan Skor Spesifik domain Skor Tes benar Waktu respons (rata-rata)
#17 GLM 5 medium Z.ai 3.5 8.3 0/3 0ms
#52 Claude Sonnet 4.6 medium Anthropic 2.9 7.4 0/3 0ms
#73 Seed-2.0-Mini medium Bytedance Seed 3.0 6.9 0/3 0ms
#160 LFM2-24B-A2B none Liquid 5.9 4.2 1/3 287ms
#163 Granite 4.1 8B none IBM Granite 3.0 4.0 0/3 357ms
#142 Mistral Small 4 none Mistral 5.3 4.9 1/3 367ms
#146 Laguna Xs.2 none Poolside 5.3 4.8 1/3 371ms
#154 Qwen3.5-9B none Qwen 3.0 4.6 0/3 464ms
#131 Qwen3.5-122B-A10B none Qwen 5.3 5.3 1/3 465ms
#117 Qwen3.5-35B-A3B none Qwen 7.7 5.6 2/3 485ms
#162 Nemotron 3 Nano Omni 30b A3b Reasoning none NVIDIA 3.6 4.1 0/3 489ms
#97 Gemini 2.5 Flash none Google 5.9 6.2 1/3 495ms
#155 Mercury 2 none Inception 5.3 4.5 1/3 534ms
#115 Qwen3.5-27B none Qwen 3.0 5.7 0/3 540ms
#152 MiMo-V2-Flash none Xiaomi 5.3 4.6 1/3 564ms

Model teratas menurut Skor Spesifik domain

Skor Spesifik domain vs total biaya

Model teratas menurut Waktu respons (rata-rata)