API error Failure Ranking

See which AI models run into API error most often, so you can spot reliability risks before choosing one.

Models Shown

Total Failures

161

Most Affected Model

Categories

In category Coding45 In category Combined26 In category Tool Calling17 In category Anti-AI Tricks14 In category Data parsing and extraction14 In category Trivia13 In category General Intelligence12 In category Puzzle Solving12 In category Domain specific7 In category Instructions following1

68/68

Rank	Model	Company	API error Count	Score	Total Cost	Tests Correct	Response Time (avg)
#209	Step 3.5 Flash none	Stepfun	4	2.3	$0.020	6/12	39.0s
Total Tests 12 Wrong Tests 6 Total Cost $0.020 Response Time (avg) 39.0s
#210	LFM2-24B-A2B none	Liquid	4	2.2	$0.001	2/16	782ms
Total Tests 16 Wrong Tests 14 Total Cost $0.001 Response Time (avg) 782ms
#100	Hy3 preview medium	Tencent	3	6.5	$0.018	14/21	16.3s
Total Tests 21 Wrong Tests 7 Total Cost $0.018 Response Time (avg) 16.3s
#144	KAT-Coder-Air V2.5 high	Kwaipilot	3	5.6	$0.077	7/22	15.9s
Total Tests 22 Wrong Tests 15 Total Cost $0.077 Response Time (avg) 15.9s
#162	Ling-2.6-1T none	Inclusionai	3	5.3	$0.016	4/22	8.58s
Total Tests 22 Wrong Tests 18 Total Cost $0.016 Response Time (avg) 8.58s
#193	Elephant Alpha none	Openrouter	3	4.3	$0.000	5/21	1.22s
Total Tests 21 Wrong Tests 16 Total Cost $0.000 Response Time (avg) 1.22s
#195	Elephant Alpha medium	Openrouter	3	4.3	$0.000	6/21	1.27s
Total Tests 21 Wrong Tests 15 Total Cost $0.000 Response Time (avg) 1.27s
#202	Grok Build 0.1 none	X AI	3	4.0	$0.547	7/19	28.7s
Total Tests 19 Wrong Tests 12 Total Cost $0.547 Response Time (avg) 28.7s
#206	gpt-oss-120b none	OpenAI	3	3.7	$0.010	6/19	21.6s
Total Tests 19 Wrong Tests 13 Total Cost $0.010 Response Time (avg) 21.6s
#33	Kimi K3 max	Moonshot AI	2	8.0	$3.112	16/22	122.5s
Total Tests 22 Wrong Tests 6 Total Cost $3.112 Response Time (avg) 122.5s
#76	DeepSeek V3.2 medium	DeepSeek	2	7.0	$0.078	11/22	68.6s
Total Tests 22 Wrong Tests 11 Total Cost $0.078 Response Time (avg) 68.6s
#90	Qwen3.6 35B A3B medium	Qwen	2	6.7	$0.746	13/22	58.1s
Total Tests 22 Wrong Tests 9 Total Cost $0.746 Response Time (avg) 58.1s
#108	Ring-2.6-1T medium	Inclusionai	2	6.3	$0.103	11/22	68.7s
Total Tests 22 Wrong Tests 11 Total Cost $0.103 Response Time (avg) 68.7s
#110	Gemma 4 31B medium	Google	2	6.3	$0.163	14/22	75.4s
Total Tests 22 Wrong Tests 8 Total Cost $0.163 Response Time (avg) 75.4s
#115	Gemma 4 31B none	Google	2	6.2	$0.035	10/22	5.34s
Total Tests 22 Wrong Tests 12 Total Cost $0.035 Response Time (avg) 5.34s

←

1 2 3 4 5

→

API error Failures

Filter models

Top Models by API error Count

API error Count vs Score

Top Models by Response Time (avg)