AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Categorie AI BENCHY

Clasament Programare

Vezi ce modele AI se descurcă cel mai bine la Programare, care rămân fiabile și unde apar cele mai mari diferențe.

Modele afișate

15

Media pentru Scor Programare

6.1

Cel mai bun model

Gemini 3.5 Flash 10.0
Rang Model Companie Scor Programare Scor Teste corecte Timp de răspuns (mediu)
#126 Nemotron 3 Nano Omni 30b A3b Reasoning medium NVIDIA 3.3 5.4 0/1 38.1s
#139 GPT-4o-mini none OpenAI 3.2 4.9 0/2 2.05s
#109 DeepSeek V3.2 none DeepSeek 3.1 5.7 0/2 20.9s
#96 Nemotron 3 Super medium NVIDIA 3.1 5.9 0/2 62.4s
#20 Gemini 3 PRO Preview medium Google 3.0 8.1 0/2 0ms
#34 Step 3.5 Flash none Stepfun 3.0 7.8 0/1 0ms
#76 Hunter Alpha medium OpenRouter 3.0 6.7 0/1 0ms
#112 Hunter Alpha none OpenRouter 3.0 5.7 0/1 0ms
#58 Step 3.5 Flash medium Stepfun 3.0 7.4 0/1 62.8s
#31 Gemma 4 26B A4B medium Google 2.9 7.8 0/2 258.4s
#151 Qwen3.5-9B medium Qwen 2.8 4.2 0/2 135.6s
#83 DeepSeek V4 Pro high DeepSeek 2.8 6.6 0/2 51.8s
#129 Laguna Xs.2 none Poolside 2.5 5.3 0/1 1.96s
#88 Grok 4.1 Fast medium X AI 2.3 6.5 0/1 23.6s
#147 Hy3 preview none Tencent 2.3 4.6 0/1 4.56s

Top modele după Scor Programare

Scor Programare vs cost total

Top modele după Timp de răspuns (mediu)