No answer Failure Ranking

See which AI models run into No answer most often, so you can spot reliability risks before choosing one.

Models Shown

Total Failures

Most Affected Model

Categories

In category Combined29 In category Coding18 In category Trivia12 In category Domain specific8 In category Data parsing and extraction5 In category Anti-AI Tricks4 In category Puzzle Solving3 In category Instructions following2 In category Tool Calling2

64/64

Rank	Model	Company	No answer Count	Score	Total Cost	Tests Correct	Response Time (avg)
#85	Step 3.7 Flash high	Stepfun	4	6.9	$1.207	11/22	64.7s
Total Tests 22 Wrong Tests 11 Total Cost $1.207 Response Time (avg) 64.7s
#96	Qwen3.6 27B medium	Qwen	3	6.5	$0.779	10/22	106.3s
Total Tests 22 Wrong Tests 12 Total Cost $0.779 Response Time (avg) 106.3s
#190	GLM 4.7 Flash medium	Z.ai	3	4.3	$0.166	4/22	142.6s
Total Tests 22 Wrong Tests 18 Total Cost $0.166 Response Time (avg) 142.6s
#12	Grok 4.5 high	X AI	2	8.9	$1.707	17/22	76.5s
Total Tests 22 Wrong Tests 5 Total Cost $1.707 Response Time (avg) 76.5s
#17	Claude Fable 5 medium	Anthropic	2	8.6	$3.478	17/22	17.2s
Total Tests 22 Wrong Tests 5 Total Cost $3.478 Response Time (avg) 17.2s
#38	GLM 5.2 medium	Z.ai	2	7.8	$0.068	15/21	23.3s
Total Tests 21 Wrong Tests 6 Total Cost $0.068 Response Time (avg) 23.3s
#76	Kimi K2.5 medium	Moonshot AI	2	7.0	$0.600	10/22	99.0s
Total Tests 22 Wrong Tests 12 Total Cost $0.600 Response Time (avg) 99.0s
#93	Gemma 4 26B A4B medium	Google	2	6.6	$0.082	14/22	103.8s
Total Tests 22 Wrong Tests 8 Total Cost $0.082 Response Time (avg) 103.8s
#108	Claude Sonnet 5 none	Anthropic	2	6.3	$0.548	8/22	6.04s
Total Tests 22 Wrong Tests 14 Total Cost $0.548 Response Time (avg) 6.04s
#115	Qwen3.5-35B-A3B medium	Qwen	2	6.2	$0.837	11/22	112.5s
Total Tests 22 Wrong Tests 11 Total Cost $0.837 Response Time (avg) 112.5s
#130	Mimo V2 Omni medium	Xiaomi	2	5.9	$0.683	10/21	41.2s
Total Tests 21 Wrong Tests 11 Total Cost $0.683 Response Time (avg) 41.2s
#168	MiniMax M2.7 medium	Minimax	2	5.0	$0.163	5/22	41.3s
Total Tests 22 Wrong Tests 17 Total Cost $0.163 Response Time (avg) 41.3s
#186	MiniMax M2.5 medium	Minimax	2	4.6	$0.340	5/22	68.3s
Total Tests 22 Wrong Tests 17 Total Cost $0.340 Response Time (avg) 68.3s
#194	Laguna Xs.2 medium	Poolside	2	4.1	$0.015	6/19	6.73s
Total Tests 19 Wrong Tests 13 Total Cost $0.015 Response Time (avg) 6.73s
#200	Qwen3.5-9B medium	Qwen	2	3.8	$0.036	3/22	82.2s
Total Tests 22 Wrong Tests 19 Total Cost $0.036 Response Time (avg) 82.2s

1 2 3 4 5

→

No answer Failures

Filter models

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)