AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category Failures

Coding: No answer

Coding
No answer

See which AI models are most likely to hit No answer on Coding, so you can spot weak points faster. Sort by: Response Time (avg) ↑.

Models Shown

3

Total Failures

18

Most Affected Model

Gemini 3 PRO Preview 1
Rank Model Company No answer Count Category Score Tests Correct Response Time (avg)
#79 Kimi K2.5 medium Moonshot AI 1 4.1 0/2 215.9s
#70 Qwen3.5-35B-A3B medium Qwen 1 6.5 1/2 244.5s
#47 Gemma 4 26B A4B medium Google 1 2.9 0/2 258.4s

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost