AI BENCHY Category Failures
Tool Calling: No answer
Tool Calling
No answer
See which AI models are most likely to hit No answer on Tool Calling, so you can spot weak points faster. Sort by: Tests Correct ↑.
Failure Reasons
| Rank | Model | Company | No answer Count | Category Score | Tests Correct | Response Time (avg) |
|---|---|---|---|---|---|---|
| #40 | GPT-5.2 medium | OpenAI | 1 | 4.7 | 0/1 | 10.3s |
| #52 | Grok 4.1 Fast medium | X AI | 1 | 2.8 | 0/1 | 27.7s |