Tool Calling x No answer Ranking

See which AI models are most likely to hit No answer on Tool Calling, so you can spot weak points faster. Sort by: Tests Correct ↑.

Models Shown

Total Failures

Most Affected Model

Failure Reasons

API error17 Invalid tool call9 Did not follow instructions8 Wrong answer3 No answer2

Categories

Combined29 Coding18 Trivia13 Data parsing and extraction8 Domain specific8 Anti-AI Tricks4 Puzzle Solving3 Instructions following2 Tool Calling2

2/2

Rank	Model	Company	No answer Count	Category Score	Total Cost	Tests Correct	Response Time (avg)
#21	GPT-5.2 medium	OpenAI	1	4.7	$0.951	0/1	10.3s
Total Tests 1 Wrong Tests 1 Total Cost $0.951 Response Time (avg) 10.3s
#185	Grok 4.1 Fast medium	X AI	1	2.8	$0.069	0/1	27.7s
Total Tests 1 Wrong Tests 1 Total Cost $0.069 Response Time (avg) 27.7s

Filter models