AI BENCHY
Advertise here

Kategori AI BENCHY

Peringkat Spesifik domain

Lihat model AI mana yang paling baik di Spesifik domain, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Tes benar ↓.

Model yang ditampilkan

15

Rata-rata Skor Spesifik domain

4.8

Peringkat Model Perusahaan Skor Spesifik domain Skor Tes benar Waktu respons (rata-rata)
#123 MiMo-V2.5-Pro none Xiaomi 5.3 5.5 1/3 877ms
#124 Kimi K2.6 none Moonshot AI 5.3 5.5 1/3 1.48s
#125 GPT-5.4 none OpenAI 5.3 5.5 1/3 1.07s
#128 Qwen3.6 Flash none Qwen 5.3 5.4 1/3 1.11s
#131 Qwen3.5-122B-A10B none Qwen 5.3 5.3 1/3 465ms
#132 Mistral Small 4 medium Mistral 5.3 5.3 1/3 6.11s
#134 GLM 5 Turbo none Z.ai 5.3 5.2 1/3 1.97s
#135 Kimi K2.5 none Moonshot AI 5.3 5.2 1/3 4.38s
#139 DeepSeek V4 Flash none DeepSeek 5.3 5.0 1/3 19.7s
#140 Qwen3 Coder Next none Qwen 5.3 4.9 1/3 962ms
#142 Mistral Small 4 none Mistral 5.3 4.9 1/3 367ms
#146 Laguna Xs.2 none Poolside 5.3 4.8 1/3 371ms
#150 Qwen3 Coder Next medium Qwen 5.3 4.6 1/3 638ms
#151 Trinity Large Preview none Arcee AI 5.3 4.6 1/3 877ms
#152 MiMo-V2-Flash none Xiaomi 5.3 4.6 1/3 564ms

Model teratas menurut Skor Spesifik domain

Skor Spesifik domain vs total biaya

Model teratas menurut Waktu respons (rata-rata)