Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Metric ↑.

Models Shown

Average Anti-AI Tricks Score

7.1

Best Model

LFM2-24B-A2B 2.5

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

210/210

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#200	MiMo-V2-Flash none	Xiaomi	3.2	4.0	$0.025	0/4	1.19s
Total Tests 4 Wrong Tests 4 Total Cost $0.025 Response Time (avg) 1.19s
#82	DeepSeek V4 Pro none	DeepSeek	3.2	6.9	$0.096	0/4	4.02s
Total Tests 4 Wrong Tests 4 Total Cost $0.096 Response Time (avg) 4.02s
#154	MiMo-V2.5-Pro none	Xiaomi	3.3	5.5	$0.068	0/4	2.67s
Total Tests 4 Wrong Tests 4 Total Cost $0.068 Response Time (avg) 2.67s
#162	Ling-2.6-1T none	Inclusionai	3.4	5.3	$0.016	0/4	6.55s
Total Tests 4 Wrong Tests 4 Total Cost $0.016 Response Time (avg) 6.55s
#127	Qwen3.5-35B-A3B none	Qwen	3.4	6.1	$0.106	0/4	1.43s
Total Tests 4 Wrong Tests 4 Total Cost $0.106 Response Time (avg) 1.43s
#148	Owl Alpha none	Openrouter	3.4	5.6	$0.000	0/4	2.78s
Total Tests 4 Wrong Tests 4 Total Cost $0.000 Response Time (avg) 2.78s
#165	Mistral Small 4 none	Mistral	3.4	5.1	$0.022	0/4	395ms
Total Tests 4 Wrong Tests 4 Total Cost $0.022 Response Time (avg) 395ms
#192	Laguna M.1 none	Poolside	3.4	4.4	$0.009	0/4	705ms
Total Tests 4 Wrong Tests 4 Total Cost $0.009 Response Time (avg) 705ms
#125	Qwen3.5-Flash none	Qwen	3.5	6.1	$0.073	0/4	1.32s
Total Tests 4 Wrong Tests 4 Total Cost $0.073 Response Time (avg) 1.32s
#187	Qwen3 Coder Next medium	Qwen	3.5	4.7	$0.032	0/4	8.64s
Total Tests 4 Wrong Tests 4 Total Cost $0.032 Response Time (avg) 8.64s
#129	Nemotron 3 Ultra none	NVIDIA	3.5	6.1	$0.095	0/4	2.35s
Total Tests 4 Wrong Tests 4 Total Cost $0.095 Response Time (avg) 2.35s
#147	Mimo V2 PRO none	Xiaomi	3.5	5.6	$0.045	0/4	1.80s
Total Tests 4 Wrong Tests 4 Total Cost $0.045 Response Time (avg) 1.80s
#168	MiMo-V2.5 none	Xiaomi	3.5	5.1	$0.025	0/4	2.19s
Total Tests 4 Wrong Tests 4 Total Cost $0.025 Response Time (avg) 2.19s
#180	GPT-5.4 Nano none	OpenAI	3.5	4.8	$0.041	0/4	1.18s
Total Tests 4 Wrong Tests 4 Total Cost $0.041 Response Time (avg) 1.18s
#196	Hunter Alpha none	OpenRouter	3.5	4.2	$0.000	0/4	3.81s
Total Tests 4 Wrong Tests 4 Total Cost $0.000 Response Time (avg) 3.81s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)