API error Failure Ranking

See which AI models run into API error most often, so you can spot reliability risks before choosing one.

Models Shown

Total Failures

159

Most Affected Model

Categories

In category Coding44 In category Combined25 In category Tool Calling17 In category Anti-AI Tricks14 In category Data parsing and extraction14 In category Trivia13 In category General Intelligence12 In category Puzzle Solving12 In category Domain specific7 In category Instructions following1

66/66

Rank	Model	Company	API error Count	Score	Total Cost	Tests Correct	Response Time (avg)
#171	Qwen3.6 Plus Preview medium	Qwen	8	4.9	$0.000	9/19	15.2s
Total Tests 19 Wrong Tests 10 Total Cost $0.000 Response Time (avg) 15.2s
#131	Hy3 preview high	Tencent	7	5.9	$0.048	11/21	56.6s
Total Tests 21 Wrong Tests 10 Total Cost $0.048 Response Time (avg) 56.6s
#149	Hy3 preview low	Tencent	7	5.5	$0.015	10/21	24.6s
Total Tests 21 Wrong Tests 11 Total Cost $0.015 Response Time (avg) 24.6s
#175	Ring-2.6-1T none	Inclusionai	6	4.8	$0.026	9/22	55.1s
Total Tests 22 Wrong Tests 13 Total Cost $0.026 Response Time (avg) 55.1s
#203	Nemotron 3 Nano Omni 30b A3b Reasoning medium	NVIDIA	6	3.4	$0.000	4/19	17.1s
Total Tests 19 Wrong Tests 15 Total Cost $0.000 Response Time (avg) 17.1s
#204	Nemotron 3 Nano Omni 30b A3b Reasoning none	NVIDIA	6	3.2	$0.000	2/19	728ms
Total Tests 19 Wrong Tests 17 Total Cost $0.000 Response Time (avg) 728ms
#78	Gemini 3.5 Flash none	Google	4	7.0	$1.079	15/22	9.93s
Total Tests 22 Wrong Tests 7 Total Cost $1.079 Response Time (avg) 9.93s
#129	Gemini 3 PRO Preview medium	Google	4	6.0	$0.385	14/21	9.05s
Total Tests 21 Wrong Tests 7 Total Cost $0.385 Response Time (avg) 9.05s
#136	Nemotron 3 Super medium	NVIDIA	4	5.7	$0.066	8/22	52.0s
Total Tests 22 Wrong Tests 14 Total Cost $0.066 Response Time (avg) 52.0s
#169	DeepSeek V3.2 none	DeepSeek	4	5.0	$0.054	6/22	18.3s
Total Tests 22 Wrong Tests 16 Total Cost $0.054 Response Time (avg) 18.3s
#182	Laguna M.1 medium	Poolside	4	4.7	$0.033	9/19	14.7s
Total Tests 19 Wrong Tests 10 Total Cost $0.033 Response Time (avg) 14.7s
#188	Laguna M.1 none	Poolside	4	4.4	$0.009	4/19	2.89s
Total Tests 19 Wrong Tests 15 Total Cost $0.009 Response Time (avg) 2.89s
#194	Laguna Xs.2 medium	Poolside	4	4.1	$0.015	6/19	6.73s
Total Tests 19 Wrong Tests 13 Total Cost $0.015 Response Time (avg) 6.73s
#195	Hy3 preview none	Tencent	4	4.0	$0.003	4/21	12.9s
Total Tests 21 Wrong Tests 17 Total Cost $0.003 Response Time (avg) 12.9s
#201	Laguna Xs.2 none	Poolside	4	3.8	$0.004	5/19	806ms
Total Tests 19 Wrong Tests 14 Total Cost $0.004 Response Time (avg) 806ms

1 2 3 4 5

→

API error Failures

Filter models

Top Models by API error Count

API error Count vs Score

Top Models by Response Time (avg)