AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Failures

No answer Failures

See which AI models run into No answer most often, so you can spot reliability risks before choosing one. Sort by: Score ↑.

Models Shown

7

Total Failures

8

Most Affected Model

GLM 4.7 Flash 2
Rank Model Company No answer Count Score Tests Correct Response Time (avg)
#93 GLM 4.7 Flash medium Z.ai 2 4.6 4/18 32.3s
#52 Grok 4.1 Fast medium X AI 1 6.7 9/18 23.9s
#46 Kimi K2.5 medium Moonshot AI 1 7.0 9/18 72.4s
#43 Qwen3.5-35B-A3B medium Qwen 1 7.4 10/18 44.5s
#40 GPT-5.2 medium OpenAI 1 7.5 11/18 14.0s
#35 MiMo-V2-Omni medium Xiaomi 1 7.7 11/18 16.8s
#13 GLM 5 medium Z.ai 1 8.4 13/18 23.3s

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)