AI BENCHY Category Failures
Tool Calling
No answer
Tool Calling
No answer
See which AI models are most likely to hit No answer on Tool Calling, so you can spot weak points faster. Sort by: Response Time (avg) ↑.
Related Failure Reasons
Related Categories
| Rank | Model | Company | No answer Count | Category Score | Tests Correct | Response Time (avg) |
|---|---|---|---|---|---|---|
| #27 | GPT-5.2 medium | OpenAI | 1 | 10.0 | 0/1 | 10.3s |
| #30 | Grok 4.1 Fast medium | X AI | 1 | 10.0 | 0/1 | 27.7s |