AI BENCHY
Advertise here

AI BENCHY Category Failures

Trivia: API error

Trivia
API error

See which AI models are most likely to hit API error on Trivia, so you can spot weak points faster. Sort by: Failure Count ↑.

Models Shown

12

Total Failures

12

Most Affected Model

Gemini 3 PRO Preview 1
Rank Model Company API error Count Category Score Tests Correct Response Time (avg)
#35 Gemini 3 PRO Preview medium Google 1 3.0 0/1 0ms
#92 Laguna M.1 medium Poolside 1 3.0 0/1 0ms
#93 Qwen3.6 Plus Preview medium Qwen 1 3.0 0/1 0ms
#107 Laguna Xs.2 medium Poolside 1 3.0 0/1 0ms
#136 Elephant Alpha medium Openrouter 1 3.0 0/1 0ms
#137 Elephant Alpha none Openrouter 1 3.0 0/1 0ms
#145 Laguna M.1 none Poolside 1 3.0 0/1 0ms
#146 Laguna Xs.2 none Poolside 1 3.0 0/1 0ms
#149 Nemotron 3 Nano Omni 30b A3b Reasoning medium NVIDIA 1 3.0 0/1 0ms
#159 Ling-2.6-1T none Inclusionai 1 3.0 0/1 0ms
#161 Qwen3.5-9B medium Qwen 1 3.0 0/1 177.0s
#162 Nemotron 3 Nano Omni 30b A3b Reasoning none NVIDIA 1 3.0 0/1 0ms

Top Models by API error Count

API error Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost