AI BENCHY
Your ad here

Kategori AI BENCHY

Peringkat Spesifik domain

Lihat model AI mana yang paling baik di Spesifik domain, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.

Model yang ditampilkan

15

Rata-rata Skor Spesifik domain

4.8

Peringkat Model Perusahaan Skor Spesifik domain Skor Tes benar Waktu respons (rata-rata)
#40 GPT-5.2 medium OpenAI 5.9 7.5 1/3 77.8s
#41 MiMo-V2-Flash medium Xiaomi 5.9 7.5 1/3 96.0s
#62 Gemini 2.5 Flash none Google 5.9 6.2 1/3 495ms
#95 Grok 4.1 Fast none X AI 5.9 4.5 1/3 1.06s
#98 LFM2-24B-A2B none Liquid 5.9 4.1 1/3 287ms
#52 Grok 4.1 Fast medium X AI 5.8 6.7 1/3 121.8s
#8 Qwen3.5 Plus 2026-02-15 medium Qwen 5.3 8.5 1/3 17.5s
#10 Qwen3.5-27B medium Qwen 5.3 8.4 1/3 79.5s
#11 Gemini 3.1 Flash Lite Preview high Google 5.3 8.4 1/3 127.6s
#12 Gemini 3 PRO Preview medium Google 5.3 8.4 1/3 7.01s
#22 Gemini 3.1 Flash Lite Preview low Google 5.3 8.1 1/3 2.36s
#23 MiMo-V2-Pro medium Xiaomi 5.3 8.1 1/3 6.00s
#25 Grok 4.20 Beta medium X AI 5.3 8.0 1/3 21.3s
#28 GPT-5.2 Chat none OpenAI 5.3 7.9 1/3 17.8s
#29 Gemini 3.1 Flash Lite Preview none Google 5.3 7.9 1/3 942ms

Model teratas menurut Skor Spesifik domain

Skor Spesifik domain vs total biaya

Model teratas menurut Waktu respons (rata-rata)