AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category Failures

Coding: API error

Coding
API error

See which AI models are most likely to hit API error on Coding, so you can spot weak points faster. Sort by: Tests Correct ↑.

Models Shown

6

Total Failures

6

Most Affected Model

Gemini 3 PRO Preview 1
Rank Model Company API error Count Category Score Tests Correct Response Time (avg)
#10 Gemini 3 PRO Preview medium Google 1 3.0 0/1 0ms
#18 Qwen3.6 Plus medium Qwen 1 3.0 0/1 0ms
#47 Hunter Alpha medium OpenRouter 1 3.0 0/1 0ms
#48 Nemotron 3 Super medium NVIDIA 1 3.0 0/1 0ms
#68 Hunter Alpha none OpenRouter 1 3.0 0/1 0ms
#93 Step 3.5 Flash none Stepfun 1 3.0 0/1 0ms

Top Models by API error Count

API error Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost