Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Metric ↑.

Models Shown

Average Anti-AI Tricks Score

7.2

Best Model

LFM2-24B-A2B 2.5

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

216/216

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#91	GPT-5.5 none	OpenAI	6.9	6.9	$0.544	2/4	1.31s
Total Tests 4 Wrong Tests 2 Total Cost $0.544 Response Time (avg) 1.31s
#204	Laguna Xs.2 medium	Poolside	6.9	4.1	$0.015	2/4	2.68s
Total Tests 4 Wrong Tests 2 Total Cost $0.015 Response Time (avg) 2.68s
#72	Kimi K2.6 medium	Moonshot AI	7.0	7.2	$1.036	2/4	11.6s
Total Tests 4 Wrong Tests 2 Total Cost $1.036 Response Time (avg) 11.6s
#73	KAT-Coder-Pro V2.5 high	Kwaipilot	7.0	7.2	$0.482	2/4	3.17s
Total Tests 4 Wrong Tests 2 Total Cost $0.482 Response Time (avg) 3.17s
#29	GPT-5 Mini medium	OpenAI	7.1	8.1	$0.237	2/4	13.9s
Total Tests 4 Wrong Tests 2 Total Cost $0.237 Response Time (avg) 13.9s
#98	GLM 5V Turbo medium	Z.ai	7.2	6.7	$0.457	2/4	10.8s
Total Tests 4 Wrong Tests 2 Total Cost $0.457 Response Time (avg) 10.8s
#111	Gemini 3.1 Flash Lite low	Google	7.3	6.5	$0.621	2/4	1.84s
Total Tests 4 Wrong Tests 2 Total Cost $0.621 Response Time (avg) 1.84s
#56	Kimi K2.7 Code medium	Moonshot AI	7.3	7.5	$0.740	2/4	11.6s
Total Tests 4 Wrong Tests 2 Total Cost $0.740 Response Time (avg) 11.6s
#81	Kimi K2.5 medium	Moonshot AI	7.3	7.0	$0.600	2/4	51.4s
Total Tests 4 Wrong Tests 2 Total Cost $0.600 Response Time (avg) 51.4s
#164	KAT-Coder-Air V2.5 low	Kwaipilot	7.3	5.4	$0.041	2/4	3.50s
Total Tests 4 Wrong Tests 2 Total Cost $0.041 Response Time (avg) 3.50s
#190	Hunter Alpha medium	OpenRouter	7.3	4.7	$0.000	2/4	4.75s
Total Tests 4 Wrong Tests 2 Total Cost $0.000 Response Time (avg) 4.75s
#30	Muse Spark 1.1 high	Meta	7.5	8.1	$1.694	2/4	8.60s
Total Tests 4 Wrong Tests 2 Total Cost $1.694 Response Time (avg) 8.60s
#112	Gemini 3.1 Flash Lite Preview none	Google	7.5	6.4	$0.052	2/4	1.04s
Total Tests 4 Wrong Tests 2 Total Cost $0.052 Response Time (avg) 1.04s
#169	Gemini 3.1 Flash Lite Preview high	Google	7.5	5.3	$2.310	3/3	43.9s
Total Tests 3 Wrong Tests 0 Total Cost $2.310 Response Time (avg) 43.9s
#128	Gemini 3.1 Flash Lite none	Google	7.5	6.1	$0.046	2/4	1.07s
Total Tests 4 Wrong Tests 2 Total Cost $0.046 Response Time (avg) 1.07s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)