AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Spesifik domain

Lihat model AI mana yang paling baik di Spesifik domain, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↓.

Model yang ditampilkan

13

Rata-rata Skor Spesifik domain

4.8

Model terbaik

MiniMax M2.5 2.9
Peringkat Model Perusahaan Skor Spesifik domain Skor Tes benar Waktu respons (rata-rata)
#155 Mercury 2 none Inception 5.3 4.5 1/3 534ms
#97 Gemini 2.5 Flash none Google 5.9 6.2 1/3 495ms
#162 Nemotron 3 Nano Omni 30b A3b Reasoning none NVIDIA 3.6 4.1 0/3 489ms
#117 Qwen3.5-35B-A3B none Qwen 7.7 5.6 2/3 485ms
#131 Qwen3.5-122B-A10B none Qwen 5.3 5.3 1/3 465ms
#154 Qwen3.5-9B none Qwen 3.0 4.6 0/3 464ms
#146 Laguna Xs.2 none Poolside 5.3 4.8 1/3 371ms
#142 Mistral Small 4 none Mistral 5.3 4.9 1/3 367ms
#163 Granite 4.1 8B none IBM Granite 3.0 4.0 0/3 357ms
#160 LFM2-24B-A2B none Liquid 5.9 4.2 1/3 287ms
#17 GLM 5 medium Z.ai 3.5 8.3 0/3 0ms
#52 Claude Sonnet 4.6 medium Anthropic 2.9 7.4 0/3 0ms
#73 Seed-2.0-Mini medium Bytedance Seed 3.0 6.9 0/3 0ms

Model teratas menurut Skor Spesifik domain

Skor Spesifik domain vs total biaya

Model teratas menurut Waktu respons (rata-rata)