Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.

Models Shown

Average Anti-AI Tricks Score

7.1

Best Model

Gemini 3 Flash Preview 10.0

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

210/210

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#22	Grok 4.5 medium	X AI	10.0	8.3	$1.928	4/4	23.5s
Total Tests 4 Wrong Tests 0 Total Cost $1.928 Response Time (avg) 23.5s
#23	Claude Sonnet 5 medium	Anthropic	10.0	8.3	$0.922	4/4	3.80s
Total Tests 4 Wrong Tests 0 Total Cost $0.922 Response Time (avg) 3.80s
#28	Inkling high	Thinkingmachines	10.0	8.0	$1.006	4/4	12.7s
Total Tests 4 Wrong Tests 0 Total Cost $1.006 Response Time (avg) 12.7s
#31	GLM 5.2 high	Z.ai	10.0	8.0	$0.970	4/4	5.80s
Total Tests 4 Wrong Tests 0 Total Cost $0.970 Response Time (avg) 5.80s
#33	Kimi K3 max	Moonshot AI	10.0	8.0	$3.112	4/4	10.2s
Total Tests 4 Wrong Tests 0 Total Cost $3.112 Response Time (avg) 10.2s
#36	Qwen3.7 Plus medium	Qwen	10.0	7.9	$0.267	4/4	8.58s
Total Tests 4 Wrong Tests 0 Total Cost $0.267 Response Time (avg) 8.58s
#37	Qwen3.6 Plus medium	Qwen	10.0	7.8	$0.405	4/4	9.90s
Total Tests 4 Wrong Tests 0 Total Cost $0.405 Response Time (avg) 9.90s
#38	GLM 5.2 medium	Z.ai	10.0	7.8	$0.222	4/4	5.89s
Total Tests 4 Wrong Tests 0 Total Cost $0.222 Response Time (avg) 5.89s
#41	Claude Opus 4.8 low	Anthropic	10.0	7.8	$2.077	4/4	3.30s
Total Tests 4 Wrong Tests 0 Total Cost $2.077 Response Time (avg) 3.30s
#42	GLM 5 medium	Z.ai	10.0	7.7	$0.307	4/4	23.7s
Total Tests 4 Wrong Tests 0 Total Cost $0.307 Response Time (avg) 23.7s
#49	GLM 5 Turbo medium	Z.ai	10.0	7.6	$0.323	4/4	4.82s
Total Tests 4 Wrong Tests 0 Total Cost $0.323 Response Time (avg) 4.82s
#51	Nemotron 3 Ultra medium	NVIDIA	10.0	7.5	$0.774	4/4	8.62s
Total Tests 4 Wrong Tests 0 Total Cost $0.774 Response Time (avg) 8.62s
#60	LongCat 2.0 medium	Meituan	10.0	7.4	$0.478	4/4	9.65s
Total Tests 4 Wrong Tests 0 Total Cost $0.478 Response Time (avg) 9.65s
#61	Gemini 3 Flash Preview low	Google	10.0	7.4	$0.177	4/4	3.48s
Total Tests 4 Wrong Tests 0 Total Cost $0.177 Response Time (avg) 3.48s
#70	Qwen3.5 Plus 2026-04-20 medium	Qwen	10.0	7.2	$0.317	4/4	10.8s
Total Tests 4 Wrong Tests 0 Total Cost $0.317 Response Time (avg) 10.8s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)