AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Categorie AI BENCHY

Clasament Programare

Vezi ce modele AI se descurcă cel mai bine la Programare, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Teste corecte ↓.

Modele afișate

15

Media pentru Scor Programare

6.1

Cel mai bun model

Gemini 3.5 Flash 10.0
Rang Model Companie Scor Programare Scor Teste corecte Timp de răspuns (mediu)
#25 Gemini 3.5 Flash minimal Google 7.0 7.9 1/2 3.39s
#26 Qwen3.5-27B medium Qwen 7.0 7.9 1/2 123.9s
#27 Qwen3.7 Max none Qwen 6.8 7.9 1/2 1.39s
#28 GPT-5.4 medium OpenAI 8.2 7.9 1/2 55.0s
#29 GLM 5 Turbo medium Z.ai 7.3 7.9 1/2 53.9s
#30 GPT-5.2 Chat none OpenAI 8.2 7.9 1/2 8.05s
#32 Qwen3.6 35B A3B medium Qwen 6.6 7.8 1/2 59.3s
#33 Grok 4.3 medium X AI 7.4 7.8 1/2 55.3s
#36 Gemini 3.1 Flash Lite Preview medium Google 6.8 7.7 1/2 3.98s
#37 Gemini 3.1 Flash Lite medium Google 6.8 7.7 1/2 3.59s
#38 Gemini 2.5 Flash medium Google 6.6 7.7 1/2 54.6s
#41 Gemini 3 Flash Preview none Google 6.8 7.7 1/2 2.19s
#42 Grok Build 0.1 medium X AI 7.0 7.7 1/2 62.6s
#44 DeepSeek V4 Flash high DeepSeek 6.8 7.6 1/2 58.1s
#45 MiMo-V2.5-Pro medium Xiaomi 7.0 7.6 1/2 81.7s

Top modele după Scor Programare

Scor Programare vs cost total

Top modele după Timp de răspuns (mediu)