AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Kategori AI BENCHY

Peringkat Pemrograman

Lihat model AI mana yang paling baik di Pemrograman, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.

Model yang ditampilkan

15

Rata-rata Skor Pemrograman

6.1

Peringkat Model Perusahaan Skor Pemrograman Skor Tes benar Waktu respons (rata-rata)
#126 Nemotron 3 Nano Omni 30b A3b Reasoning medium NVIDIA 3.3 5.4 0/1 38.1s
#139 GPT-4o-mini none OpenAI 3.2 4.9 0/2 2.05s
#109 DeepSeek V3.2 none DeepSeek 3.1 5.7 0/2 20.9s
#96 Nemotron 3 Super medium NVIDIA 3.1 5.9 0/2 62.4s
#20 Gemini 3 PRO Preview medium Google 3.0 8.1 0/2 0ms
#34 Step 3.5 Flash none Stepfun 3.0 7.8 0/1 0ms
#76 Hunter Alpha medium OpenRouter 3.0 6.7 0/1 0ms
#112 Hunter Alpha none OpenRouter 3.0 5.7 0/1 0ms
#58 Step 3.5 Flash medium Stepfun 3.0 7.4 0/1 62.8s
#31 Gemma 4 26B A4B medium Google 2.9 7.8 0/2 258.4s
#151 Qwen3.5-9B medium Qwen 2.8 4.2 0/2 135.6s
#83 DeepSeek V4 Pro high DeepSeek 2.8 6.6 0/2 51.8s
#129 Laguna Xs.2 none Poolside 2.5 5.3 0/1 1.96s
#88 Grok 4.1 Fast medium X AI 2.3 6.5 0/1 23.6s
#147 Hy3 preview none Tencent 2.3 4.6 0/1 4.56s

Model teratas menurut Skor Pemrograman

Skor Pemrograman vs total biaya

Model teratas menurut Waktu respons (rata-rata)