AI BENCHY
Advertise here

AI BENCHY Category Failures

Trivia: No answer

Trivia
No answer

See which AI models are most likely to hit No answer on Trivia, so you can spot weak points faster. Sort by: Tests Correct ↓.

Models Shown

6

Total Failures

6

Most Affected Model

Claude Opus 4.8 1
Rank Model Company No answer Count Category Score Tests Correct Response Time (avg)
#10 Claude Opus 4.8 medium Anthropic 1 3.0 0/1 6.14s
#22 Step 3.7 Flash medium Stepfun 1 3.0 0/1 114.0s
#57 Step 3.7 Flash low Stepfun 1 3.0 0/1 124.8s
#67 MiniMax M3 medium Minimax 1 3.0 0/1 100.8s
#68 Claude Opus 4.8 none Anthropic 1 3.0 0/1 3.41s
#71 Step 3.7 Flash high Stepfun 1 3.0 0/1 149.3s

Top Models by No answer Count

No answer Count vs Score

Top Models by Response Time (avg)

Top Models by Estimated Wasted Cost