AI BENCHY Category
Coding Ranking
See which AI models perform best on Coding, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.
| Rank | Model | Company | Coding Score | Score | Tests Correct | Response Time (avg) |
|---|---|---|---|---|---|---|
| #153 | Granite 4.1 8B none | IBM Granite | 5.2 | 4.1 | 0/2 | 706ms |