No answer Failure Ranking

See which AI models run into No answer most often, so you can spot reliability risks before choosing one. Sort by: Response Time (avg) ↑.

Models Shown

Total Failures

Most Affected Model

Gemini 3.1 Flash Lite Preview 1

Categories

In category Combined29 In category Coding18 In category Trivia13 In category Data parsing and extraction8 In category Domain specific8 In category Anti-AI Tricks4 In category Puzzle Solving3 In category Instructions following2 In category Tool Calling2

67/67

Rank	Model	Company	No answer Count	Score	Total Cost	Tests Correct	Response Time (avg)
#106	Gemini 3.1 Flash Lite Preview none	Google	1	6.4	$0.052	12/22	1.58s
Total Tests 22 Wrong Tests 10 Total Cost $0.052 Response Time (avg) 1.58s
#132	GPT-5.6 Terra none	OpenAI	1	6.0	$0.349	8/22	1.65s
Total Tests 22 Wrong Tests 14 Total Cost $0.349 Response Time (avg) 1.65s
#122	Gemini 3.1 Flash Lite none	Google	1	6.1	$0.046	9/22	1.75s
Total Tests 22 Wrong Tests 13 Total Cost $0.046 Response Time (avg) 1.75s
#120	Gemini 3.1 Flash Lite minimal	Google	1	6.1	$0.047	10/22	1.86s
Total Tests 22 Wrong Tests 12 Total Cost $0.047 Response Time (avg) 1.86s
#174	GPT-4o-mini none	OpenAI	1	5.0	$0.010	5/22	1.99s
Total Tests 22 Wrong Tests 17 Total Cost $0.010 Response Time (avg) 1.99s
#180	GPT-5.4 Nano none	OpenAI	1	4.8	$0.041	4/22	2.57s
Total Tests 22 Wrong Tests 18 Total Cost $0.041 Response Time (avg) 2.57s
#89	Gemini 3 Flash Preview none	Google	1	6.8	$0.085	13/22	2.95s
Total Tests 22 Wrong Tests 9 Total Cost $0.085 Response Time (avg) 2.95s
#154	MiMo-V2.5-Pro none	Xiaomi	1	5.5	$0.068	6/22	4.12s
Total Tests 22 Wrong Tests 16 Total Cost $0.068 Response Time (avg) 4.12s
#116	Seed-2.0-Lite none	Bytedance Seed	1	6.2	$0.066	8/22	4.40s
Total Tests 22 Wrong Tests 14 Total Cost $0.066 Response Time (avg) 4.40s
#168	MiMo-V2.5 none	Xiaomi	1	5.1	$0.025	5/22	4.62s
Total Tests 22 Wrong Tests 17 Total Cost $0.025 Response Time (avg) 4.62s
#66	Claude Opus 4.8 none	Anthropic	1	7.3	$1.166	13/22	4.91s
Total Tests 22 Wrong Tests 9 Total Cost $1.166 Response Time (avg) 4.91s
#161	Qwen3.6 35B A3B none	Qwen	1	5.3	$0.061	4/22	5.52s
Total Tests 22 Wrong Tests 18 Total Cost $0.061 Response Time (avg) 5.52s
#112	Claude Sonnet 5 none	Anthropic	2	6.3	$0.548	8/22	6.04s
Total Tests 22 Wrong Tests 14 Total Cost $0.548 Response Time (avg) 6.04s
#151	GLM 5.1 none	Z.ai	1	5.5	$0.164	7/22	6.70s
Total Tests 22 Wrong Tests 15 Total Cost $0.164 Response Time (avg) 6.70s
#198	Laguna Xs.2 medium	Poolside	2	4.1	$0.015	6/19	6.73s
Total Tests 19 Wrong Tests 13 Total Cost $0.015 Response Time (avg) 6.73s

1 2 3 4 5

→

No answer Failures

Filter models

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)