AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Spesifik domain

Lihat model AI mana yang paling baik di Spesifik domain, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.

Model yang ditampilkan

15

Rata-rata Skor Spesifik domain

4.8

Peringkat Model Perusahaan Skor Spesifik domain Skor Tes benar Waktu respons (rata-rata)
#121 Owl Alpha none Openrouter 5.3 5.5 1/3 3.00s
#123 MiMo-V2.5-Pro none Xiaomi 5.3 5.5 1/3 877ms
#128 Qwen3.6 Flash none Qwen 5.3 5.4 1/3 1.11s
#131 Qwen3.5-122B-A10B none Qwen 5.3 5.3 1/3 465ms
#134 GLM 5 Turbo none Z.ai 5.3 5.2 1/3 1.97s
#135 Kimi K2.5 none Moonshot AI 5.3 5.2 1/3 4.38s
#139 DeepSeek V4 Flash none DeepSeek 5.3 5.0 1/3 19.7s
#140 Qwen3 Coder Next none Qwen 5.3 4.9 1/3 962ms
#142 Mistral Small 4 none Mistral 5.3 4.9 1/3 367ms
#146 Laguna Xs.2 none Poolside 5.3 4.8 1/3 371ms
#150 Qwen3 Coder Next medium Qwen 5.3 4.6 1/3 638ms
#151 Trinity Large Preview none Arcee AI 5.3 4.6 1/3 877ms
#9 GPT-5.5 medium OpenAI 5.3 8.8 1/3 164.1s
#16 Gemini 3 Flash Preview low Google 5.3 8.4 1/3 8.05s
#21 GPT-5.4 medium OpenAI 5.3 8.0 1/3 74.3s

Model teratas menurut Skor Spesifik domain

Skor Spesifik domain vs total biaya

Model teratas menurut Waktu respons (rata-rata)