AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Gabungan

Lihat model AI mana yang paling baik di Gabungan, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Metrik ↑.

Model yang ditampilkan

15

Rata-rata Skor Gabungan

6.3

Peringkat Model Perusahaan Skor Gabungan Skor Tes benar Waktu respons (rata-rata)
#124 Kimi K2.6 none Moonshot AI 3.0 5.5 0/1 3.38s
#125 GPT-5.4 none OpenAI 3.0 5.5 0/1 2.89s
#126 gpt-oss-120b none OpenAI 3.0 5.4 0/1 0ms
#127 Grok 4.20 none X AI 3.0 5.4 0/1 6.04s
#128 Qwen3.6 Flash none Qwen 3.0 5.4 0/1 4.22s
#131 Qwen3.5-122B-A10B none Qwen 3.0 5.3 0/1 46.0s
#132 Mistral Small 4 medium Mistral 3.0 5.3 0/1 25.3s
#134 GLM 5 Turbo none Z.ai 3.0 5.2 0/1 4.89s
#136 Elephant Alpha medium Openrouter 3.0 5.1 0/1 3.70s
#137 Elephant Alpha none Openrouter 3.0 5.1 0/1 3.81s
#138 Ling-2.6-flash none Inclusionai 3.0 5.0 0/1 35.3s
#140 Qwen3 Coder Next none Qwen 3.0 4.9 0/1 45.1s
#141 Nemotron 3 Super none NVIDIA 3.0 4.9 0/1 16.4s
#142 Mistral Small 4 none Mistral 3.0 4.9 0/1 1.72s
#143 MiMo-V2.5 none Xiaomi 3.0 4.9 0/1 2.36s

Model teratas menurut Skor Gabungan

Skor Gabungan vs total biaya

Model teratas menurut Waktu respons (rata-rata)