AI BENCHY
Your ad here

AI BENCHY Failures

No answer Failures

See which AI models run into No answer most often, so you can spot reliability risks before choosing one. Sort by: Tests Correct ↑.

Models Shown

7

Total Failures

8

Most Affected Model

GLM 4.7 Flash 2
Rank Model Company No answer Count Score Tests Correct Response Time (avg)
#93 GLM 4.7 Flash medium Z.ai 2 4.6 4/18 32.3s
#46 Kimi K2.5 medium Moonshot AI 1 7.0 9/18 72.4s
#52 Grok 4.1 Fast medium X AI 1 6.7 9/18 23.9s
#43 Qwen3.5-35B-A3B medium Qwen 1 7.4 10/18 44.5s
#35 MiMo-V2-Omni medium Xiaomi 1 7.7 11/18 16.8s
#40 GPT-5.2 medium OpenAI 1 7.5 11/18 14.0s
#13 GLM 5 medium Z.ai 1 8.4 13/18 23.3s

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)