AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category Failures

General Intelligence: Timed out

General Intelligence
Timed out

See which AI models are most likely to hit Timed out on General Intelligence, so you can spot weak points faster. Sort by: Failure Count ↑.

Models Shown

4

Total Failures

4

Most Affected Model

Qwen3.5 Plus 2026-02-15 1
Rank Model Company Timed out Count Category Score Tests Correct Response Time (avg)
#8 Qwen3.5 Plus 2026-02-15 medium Qwen 1 4.7 0/1 79.9s
#19 Qwen3.5-122B-A10B medium Qwen 1 3.4 0/1 34.1s
#43 Qwen3.5-35B-A3B medium Qwen 1 2.8 0/1 30.3s
#97 Qwen3.5-9B medium Qwen 1 2.8 0/1 226.4s

Top Models by Timed out Count

Timed out Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost