Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Metric ↑.

Models Shown

Average Anti-AI Tricks Score

7.1

Best Model

LFM2-24B-A2B 2.5

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

210/210

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#210	LFM2-24B-A2B none	Liquid	2.5	2.2	$0.001	0/3	471ms
Total Tests 3 Wrong Tests 3 Total Cost $0.001 Response Time (avg) 471ms
#116	Seed-2.0-Lite none	Bytedance Seed	3.0	6.2	$0.066	0/4	2.43s
Total Tests 4 Wrong Tests 4 Total Cost $0.066 Response Time (avg) 2.43s
#118	Gemini 2.5 Flash none	Google	3.0	6.2	$0.017	0/4	582ms
Total Tests 4 Wrong Tests 4 Total Cost $0.017 Response Time (avg) 582ms
#150	DeepSeek V4 Flash none	DeepSeek	3.0	5.6	$0.044	0/4	20.2s
Total Tests 4 Wrong Tests 4 Total Cost $0.044 Response Time (avg) 20.2s
#170	GLM 5 Turbo none	Z.ai	3.0	5.1	$0.047	0/4	2.84s
Total Tests 4 Wrong Tests 4 Total Cost $0.047 Response Time (avg) 2.84s
#171	North Mini Code none	Cohere	3.0	5.1	$0.000	0/4	22.5s
Total Tests 4 Wrong Tests 4 Total Cost $0.000 Response Time (avg) 22.5s
#189	Mercury 2 none	Inception	3.0	4.6	$0.030	0/4	483ms
Total Tests 4 Wrong Tests 4 Total Cost $0.030 Response Time (avg) 483ms
#205	Laguna Xs.2 none	Poolside	3.0	3.8	$0.004	0/4	534ms
Total Tests 4 Wrong Tests 4 Total Cost $0.004 Response Time (avg) 534ms
#124	Qwen3.6 Flash none	Qwen	3.1	6.1	$0.062	0/4	1.63s
Total Tests 4 Wrong Tests 4 Total Cost $0.062 Response Time (avg) 1.63s
#183	Trinity Large Preview none	Arcee AI	3.1	4.8	$0.008	0/4	2.07s
Total Tests 4 Wrong Tests 4 Total Cost $0.008 Response Time (avg) 2.07s
#169	Qwen3.5-9B none	Qwen	3.1	5.1	$0.021	0/4	1.71s
Total Tests 4 Wrong Tests 4 Total Cost $0.021 Response Time (avg) 1.71s
#136	GPT-5.4 Mini none	OpenAI	3.1	5.9	$0.095	0/4	929ms
Total Tests 4 Wrong Tests 4 Total Cost $0.095 Response Time (avg) 929ms
#203	Grok 4.1 Fast none	X AI	3.2	3.8	$0.008	0/4	1.07s
Total Tests 4 Wrong Tests 4 Total Cost $0.008 Response Time (avg) 1.07s
#139	GPT-5.4 none	OpenAI	3.2	5.8	$0.397	0/4	1.21s
Total Tests 4 Wrong Tests 4 Total Cost $0.397 Response Time (avg) 1.21s
#173	DeepSeek V3.2 none	DeepSeek	3.2	5.0	$0.054	0/4	9.35s
Total Tests 4 Wrong Tests 4 Total Cost $0.054 Response Time (avg) 9.35s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)