AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Categorie AI BENCHY

Clasament Programare

Vezi ce modele AI se descurcă cel mai bine la Programare, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Timp de răspuns (mediu) ↓.

Modele afișate

15

Media pentru Scor Programare

6.1

Cel mai bun model

Gemma 4 26B A4B 2.9
Rang Model Companie Scor Programare Scor Teste corecte Timp de răspuns (mediu)
#105 Grok 4.20 Beta none X AI 5.5 5.8 0/1 1.14s
#85 Gemini 3.1 Flash Lite none Google 6.8 6.6 1/2 1.13s
#141 GPT-5.4 Nano none OpenAI 5.4 4.8 0/2 1.09s
#52 Gemini 3.1 Flash Lite Preview none Google 6.8 7.5 1/2 1.06s
#135 Mistral Small 4 none Mistral 4.0 5.0 0/2 1.03s
#137 GPT-5.4 Mini none OpenAI 6.8 4.9 1/2 1.01s
#98 Qwen3.5-Flash none Qwen 6.8 5.9 1/2 993ms
#78 Gemini 3.1 Flash Lite minimal Google 6.8 6.7 1/2 951ms
#146 Mercury 2 none Inception 3.5 4.6 0/2 831ms
#90 Gemini 2.5 Flash none Google 6.8 6.4 1/2 810ms
#153 Granite 4.1 8B none IBM Granite 5.2 4.1 0/2 706ms
#17 Qwen3.6 Plus Preview medium Qwen 0.0 8.2 0/0 0ms
#20 Gemini 3 PRO Preview medium Google 3.0 8.1 0/2 0ms
#34 Step 3.5 Flash none Stepfun 3.0 7.8 0/1 0ms
#76 Hunter Alpha medium OpenRouter 3.0 6.7 0/1 0ms

Top modele după Scor Programare

Scor Programare vs cost total

Top modele după Timp de răspuns (mediu)