AI BENCHY
Your ad here

AI BENCHY Category Failures

General Intelligence: Timed out

General Intelligence
Timed out

See which AI models are most likely to hit Timed out on General Intelligence, so you can spot weak points faster. Sort by: Response Time (avg) ↓.

Models Shown

4

Total Failures

4

Most Affected Model

Qwen3.5-9B 1
Rank Model Company Timed out Count Category Score Tests Correct Response Time (avg)
#97 Qwen3.5-9B medium Qwen 1 2.8 0/1 226.4s
#8 Qwen3.5 Plus 2026-02-15 medium Qwen 1 4.7 0/1 79.9s
#19 Qwen3.5-122B-A10B medium Qwen 1 3.4 0/1 34.1s
#43 Qwen3.5-35B-A3B medium Qwen 1 2.8 0/1 30.3s

Top Models by Timed out Count

Timed out Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost