AI BENCHY Category Failures
Puzzle Solving
Timed out
Puzzle Solving
Timed out
See which AI models are most likely to hit Timed out on Puzzle Solving, so you can spot weak points faster. Sort by: Tests Correct ↓.
Related Failure Reasons
Related Categories
| Rank | Model | Company | Timed out Count | Category Score | Tests Correct | Response Time (avg) |
|---|---|---|---|---|---|---|
| #24 | Qwen3.5-Flash medium | Qwen | 2 | 4.0 | 1/3 | 56.7s |
| #35 | Qwen3.5-35B-A3B medium | Qwen | 1 | 4.0 | 1/3 | 31.6s |
| #43 | MiniMax M2.5 medium | Minimax | 1 | 4.0 | 1/3 | 11.5s |