AI BENCHY Category Failures
Puzzle Solving
Timed out
Puzzle Solving
Timed out
See which AI models are most likely to hit Timed out on Puzzle Solving, so you can spot weak points faster. Sort by: Failure Count ↑.
Related Failure Reasons
Related Categories
| Rank | Model | Company | Timed out Count | Category Score | Tests Correct | Response Time (avg) |
|---|---|---|---|---|---|---|
| #35 | Qwen3.5-35B-A3B medium | Qwen | 1 | 4.0 | 1/3 | 31.6s |
| #43 | MiniMax M2.5 medium | Minimax | 1 | 4.0 | 1/3 | 11.5s |
| #24 | Qwen3.5-Flash medium | Qwen | 2 | 4.0 | 1/3 | 56.7s |