AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

Categorie AI BENCHY

Clasament Programare

Vezi ce modele AI se descurcă cel mai bine la Programare, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Metrică ↑.

Modele afișate

15

Media pentru Scor Programare

6.1

Cel mai bun model

Qwen3.6 Plus Preview 0.0
Rang Model Companie Scor Programare Scor Teste corecte Timp de răspuns (mediu)
#50 Claude Sonnet 4.6 medium Anthropic 6.9 7.6 1/2 33.9s
#55 GPT-5.3 Chat none OpenAI 6.9 7.4 1/2 10.5s
#56 MiMo-V2.5 medium Xiaomi 6.9 7.4 1/2 64.5s
#4 Gemini 3.1 Pro Preview medium Google 7.0 9.3 1/2 54.3s
#25 Gemini 3.5 Flash minimal Google 7.0 7.9 1/2 3.39s
#26 Qwen3.5-27B medium Qwen 7.0 7.9 1/2 123.9s
#42 Grok Build 0.1 medium X AI 7.0 7.7 1/2 62.6s
#111 Owl Alpha none Openrouter 7.0 5.7 1/2 39.7s
#45 MiMo-V2.5-Pro medium Xiaomi 7.0 7.6 1/2 81.7s
#23 Seed-2.0-Lite medium Bytedance Seed 7.0 8.1 1/2 107.7s
#66 Claude Opus 4.6 medium Anthropic 7.2 7.2 1/2 29.4s
#87 Mercury 2 medium Inception 7.2 6.5 1/2 2.29s
#12 Gemini 3 Flash Preview low Google 7.3 8.6 1/2 6.66s
#106 Qwen3.5-27B none Qwen 7.3 5.8 1/2 1.98s
#29 GLM 5 Turbo medium Z.ai 7.3 7.9 1/2 53.9s

Top modele după Scor Programare

Scor Programare vs cost total

Top modele după Timp de răspuns (mediu)