AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Pemrograman

Lihat model AI mana yang paling baik di Pemrograman, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.

Model yang ditampilkan

15

Rata-rata Skor Pemrograman

6.1

Peringkat Model Perusahaan Skor Pemrograman Skor Tes benar Waktu respons (rata-rata)
#137 GPT-5.4 Mini none OpenAI 6.8 4.9 1/2 1.01s
#138 Qwen3.6 35B A3B none Qwen 6.8 4.9 1/2 12.3s
#123 MiniMax M2.7 medium Minimax 6.7 5.4 1/2 54.7s
#38 Gemini 2.5 Flash medium Google 6.6 7.7 1/2 54.6s
#101 Owl Alpha medium Openrouter 6.6 5.8 1/2 19.1s
#117 Qwen3.6 Flash none Qwen 6.6 5.5 1/2 2.34s
#32 Qwen3.6 35B A3B medium Qwen 6.6 7.8 1/2 59.3s
#81 Qwen3.6 27B medium Qwen 6.6 6.6 1/2 165.4s
#57 Kimi K2.6 medium Moonshot AI 6.5 7.4 1/2 118.2s
#63 Qwen3.5-35B-A3B medium Qwen 6.5 7.3 1/2 244.5s
#82 Laguna Xs.2 medium Poolside 6.3 6.6 0/1 14.4s
#105 Grok 4.20 Beta none X AI 5.5 5.8 0/1 1.14s
#148 Ling-2.6-1T none Inclusionai 5.5 4.5 0/1 10.6s
#94 GPT-5 Nano medium OpenAI 5.4 6.1 0/2 47.8s
#95 DeepSeek V4 Pro none DeepSeek 5.4 6.0 0/2 8.27s

Model teratas menurut Skor Pemrograman

Skor Pemrograman vs total biaya

Model teratas menurut Waktu respons (rata-rata)