AI BENCHY
Advertise here

AI BENCHY Failures

Timed out Failures

See which AI models run into Timed out most often, so you can spot reliability risks before choosing one.

Models Shown

4

Total Failures

73

Most Affected Model

Qwen3.5-9B 11
Rank Model Company Timed out Count Score Tests Correct Response Time (avg)
#94 GPT-5 Nano medium OpenAI 1 6.3 9/21 42.5s
#102 Gemma 4 26B A4B none Google 1 6.0 8/21 5.91s
#105 Nemotron 3 Super medium NVIDIA 1 5.8 8/21 32.0s
#150 Qwen3 Coder Next medium Qwen 1 4.6 4/21 8.58s

Top Models by Timed out Count

Timed out Count vs Score

Top Models by Response Time (avg)