No answer Failure Ranking

See which AI models run into No answer most often, so you can spot reliability risks before choosing one.

Models Shown

Total Failures

Most Affected Model

Categories

In category Combined29 In category Coding18 In category Trivia13 In category Data parsing and extraction8 In category Domain specific8 In category Anti-AI Tricks4 In category Puzzle Solving3 In category Instructions following2 In category Tool Calling2

67/67

Rank	Model	Company	No answer Count	Score	Total Cost	Tests Correct	Response Time (avg)
#204	Qwen3.5-9B medium	Qwen	2	3.8	$0.036	3/22	82.2s
Total Tests 22 Wrong Tests 19 Total Cost $0.036 Response Time (avg) 82.2s
#14	Claude Opus 4.8 medium	Anthropic	1	8.8	$1.931	18/22	12.5s
Total Tests 22 Wrong Tests 4 Total Cost $1.931 Response Time (avg) 12.5s
#21	GPT-5.2 medium	OpenAI	1	8.4	$0.951	14/22	22.6s
Total Tests 22 Wrong Tests 8 Total Cost $0.951 Response Time (avg) 22.6s
#26	GPT-5 Mini medium	OpenAI	1	8.1	$0.237	12/22	27.6s
Total Tests 22 Wrong Tests 10 Total Cost $0.237 Response Time (avg) 27.6s
#27	Muse Spark 1.1 high	Meta	1	8.1	$1.694	12/22	31.5s
Total Tests 22 Wrong Tests 10 Total Cost $1.694 Response Time (avg) 31.5s
#29	Step 3.7 Flash medium	Stepfun	1	8.0	$0.515	14/22	26.4s
Total Tests 22 Wrong Tests 8 Total Cost $0.515 Response Time (avg) 26.4s
#30	GPT-5.2 Chat none	OpenAI	1	8.0	$0.604	14/22	7.65s
Total Tests 22 Wrong Tests 8 Total Cost $0.604 Response Time (avg) 7.65s
#31	GLM 5.2 high	Z.ai	1	8.0	$0.970	14/22	62.7s
Total Tests 22 Wrong Tests 8 Total Cost $0.970 Response Time (avg) 62.7s
#32	Inkling medium	Thinkingmachines	1	8.0	$0.391	15/22	16.2s
Total Tests 22 Wrong Tests 7 Total Cost $0.391 Response Time (avg) 16.2s
#33	Kimi K3 max	Moonshot AI	1	8.0	$3.112	16/22	122.5s
Total Tests 22 Wrong Tests 6 Total Cost $3.112 Response Time (avg) 122.5s
#35	Seed-2.0-Lite medium	Bytedance Seed	1	7.9	$0.234	14/22	48.5s
Total Tests 22 Wrong Tests 8 Total Cost $0.234 Response Time (avg) 48.5s
#41	Claude Opus 4.8 low	Anthropic	1	7.8	$2.077	16/22	12.7s
Total Tests 22 Wrong Tests 6 Total Cost $2.077 Response Time (avg) 12.7s
#42	GLM 5 medium	Z.ai	1	7.7	$0.307	15/21	33.5s
Total Tests 21 Wrong Tests 6 Total Cost $0.307 Response Time (avg) 33.5s
#46	DeepSeek V4 Pro high	DeepSeek	1	7.7	$0.200	10/22	79.1s
Total Tests 22 Wrong Tests 12 Total Cost $0.200 Response Time (avg) 79.1s
#47	MiniMax M3 medium	Minimax	1	7.6	$0.286	12/22	75.0s
Total Tests 22 Wrong Tests 10 Total Cost $0.286 Response Time (avg) 75.0s

←

1 2 3 4 5

→

No answer Failures

Filter models

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)