Trivia Model Ranking

AI BENCHY Category

See which AI models perform best on Trivia, which ones stay reliable, and where the biggest gaps appear.

Models Shown

Average Trivia Score

3.1

Best Model

Gemini 3.5 Flash 10.0

Failure Reasons

With failure reason Wrong answer133 With failure reason API error13 With failure reason No answer8

169/169

Rank	Model	Company	Trivia Score	Score	Total Cost	Tests Correct	Response Time (avg)
#1	Gemini 3.5 Flash high	Google	10.0	9.8	$1.115	1/1	3.94s
Total Tests 1 Wrong Tests 0 Total Cost $1.115 Response Time (avg) 3.94s
#2	Gemini 3 Flash Preview medium	Google	10.0	9.6	$0.667	1/1	5.50s
Total Tests 1 Wrong Tests 0 Total Cost $0.667 Response Time (avg) 5.50s
#5	Gemini 3.5 Flash low	Google	10.0	9.2	$0.349	1/1	1.88s
Total Tests 1 Wrong Tests 0 Total Cost $0.349 Response Time (avg) 1.88s
#7	Gemini 3.1 Pro Preview medium	Google	10.0	9.2	$1.054	1/1	6.27s
Total Tests 1 Wrong Tests 0 Total Cost $1.054 Response Time (avg) 6.27s
#8	Gemini 3.5 Flash medium	Google	10.0	9.1	$0.582	1/1	2.75s
Total Tests 1 Wrong Tests 0 Total Cost $0.582 Response Time (avg) 2.75s
#52	Gemini 3 Flash Preview low	Google	10.0	7.4	$0.111	1/1	2.75s
Total Tests 1 Wrong Tests 0 Total Cost $0.111 Response Time (avg) 2.75s
#3	Qwen3.7 Max medium	Qwen	3.0	9.4	$0.523	0/1	33.4s
Total Tests 1 Wrong Tests 1 Total Cost $0.523 Response Time (avg) 33.4s
#4	GPT-5.5 low	OpenAI	3.0	9.3	$0.907	0/1	10.1s
Total Tests 1 Wrong Tests 1 Total Cost $0.907 Response Time (avg) 10.1s
#6	Claude Fable 5 medium	Anthropic	3.0	9.2	$3.165	0/1	25.6s
Total Tests 1 Wrong Tests 1 Total Cost $3.165 Response Time (avg) 25.6s
#11	Qwen3.6 Max Preview medium	Qwen	3.0	8.9	$0.960	0/1	60.6s
Total Tests 1 Wrong Tests 1 Total Cost $0.960 Response Time (avg) 60.6s
#12	Claude Opus 4.8 medium	Anthropic	3.0	8.8	$1.107	0/1	6.14s
Total Tests 1 Wrong Tests 1 Total Cost $1.107 Response Time (avg) 6.14s
#13	Claude Opus 4.7 medium	Anthropic	3.0	8.7	$0.679	0/1	2.25s
Total Tests 1 Wrong Tests 1 Total Cost $0.679 Response Time (avg) 2.25s
#14	GLM 5.2 medium	Z.ai	3.0	8.7	$0.324	0/1	34.2s
Total Tests 1 Wrong Tests 1 Total Cost $0.324 Response Time (avg) 34.2s
#15	GLM 5 medium	Z.ai	3.0	8.6	$0.228	0/1	67.4s
Total Tests 1 Wrong Tests 1 Total Cost $0.228 Response Time (avg) 67.4s
#16	GPT-5 Mini medium	OpenAI	3.0	8.5	$0.159	0/1	9.99s
Total Tests 1 Wrong Tests 1 Total Cost $0.159 Response Time (avg) 9.99s

Trivia Ranking

Filter models

Top Models by Trivia Score

Trivia Score vs Total Cost

Top Models by Response Time (avg)