Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↑.

Models Shown

Average Anti-AI Tricks Score

7.2

Best Model

DeepSeek V4 Pro 3.2

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

216/216

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#127	gpt-oss-120b medium	OpenAI	6.7	6.1	$0.019	2/4	10.2s
Total Tests 4 Wrong Tests 2 Total Cost $0.019 Response Time (avg) 10.2s
#128	Gemini 3.1 Flash Lite none	Google	7.5	6.1	$0.046	2/4	1.07s
Total Tests 4 Wrong Tests 2 Total Cost $0.046 Response Time (avg) 1.07s
#134	GPT-5 Nano medium	OpenAI	6.5	6.1	$0.114	2/4	25.5s
Total Tests 4 Wrong Tests 2 Total Cost $0.114 Response Time (avg) 25.5s
#141	Hy3 preview high	Tencent	6.4	5.9	$0.048	2/4	15.1s
Total Tests 4 Wrong Tests 2 Total Cost $0.048 Response Time (avg) 15.1s
#150	KAT-Coder-Air V2.5 high	Kwaipilot	6.9	5.6	$0.077	2/4	2.49s
Total Tests 4 Wrong Tests 2 Total Cost $0.077 Response Time (avg) 2.49s
#164	KAT-Coder-Air V2.5 low	Kwaipilot	7.3	5.4	$0.041	2/4	3.50s
Total Tests 4 Wrong Tests 2 Total Cost $0.041 Response Time (avg) 3.50s
#178	MiniMax M2.7 medium	Minimax	7.9	5.0	$0.163	2/4	40.3s
Total Tests 4 Wrong Tests 2 Total Cost $0.163 Response Time (avg) 40.3s
#184	Ling-2.6-flash none	Inclusionai	6.8	4.9	$0.002	2/4	11.8s
Total Tests 4 Wrong Tests 2 Total Cost $0.002 Response Time (avg) 11.8s
#187	Grok 4.20 Multi Agent Beta medium	X AI	6.9	4.8	$5.599	2/4	3.46s
Total Tests 4 Wrong Tests 2 Total Cost $5.599 Response Time (avg) 3.46s
#190	Hunter Alpha medium	OpenRouter	7.3	4.7	$0.000	2/4	4.75s
Total Tests 4 Wrong Tests 2 Total Cost $0.000 Response Time (avg) 4.75s
#192	Laguna M.1 medium	Poolside	6.5	4.7	$0.033	2/4	4.87s
Total Tests 4 Wrong Tests 2 Total Cost $0.033 Response Time (avg) 4.87s
#196	MiniMax M2.5 medium	Minimax	7.9	4.6	$0.340	2/4	20.8s
Total Tests 4 Wrong Tests 2 Total Cost $0.340 Response Time (avg) 20.8s
#199	Elephant Alpha none	Openrouter	6.6	4.3	$0.000	2/4	963ms
Total Tests 4 Wrong Tests 2 Total Cost $0.000 Response Time (avg) 963ms
#201	Elephant Alpha medium	Openrouter	6.6	4.3	$0.000	2/4	1.19s
Total Tests 4 Wrong Tests 2 Total Cost $0.000 Response Time (avg) 1.19s
#204	Laguna Xs.2 medium	Poolside	6.9	4.1	$0.015	2/4	2.68s
Total Tests 4 Wrong Tests 2 Total Cost $0.015 Response Time (avg) 2.68s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)