Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

Average Anti-AI Tricks Score

7.2

Best Model

Seed-2.0-Mini 6.6

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

216/216

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#73	KAT-Coder-Pro V2.5 high	Kwaipilot	7.0	7.2	$0.482	2/4	3.17s
Total Tests 4 Wrong Tests 2 Total Cost $0.482 Response Time (avg) 3.17s
#137	Grok 4.20 Beta medium	X AI	8.7	6.0	$0.750	3/4	3.16s
Total Tests 4 Wrong Tests 1 Total Cost $0.750 Response Time (avg) 3.16s
#151	GLM 5V Turbo none	Z.ai	4.8	5.6	$0.052	1/4	3.13s
Total Tests 4 Wrong Tests 3 Total Cost $0.052 Response Time (avg) 3.13s
#48	GPT-5.6 Luna high	OpenAI	8.3	7.7	$1.017	3/4	2.99s
Total Tests 4 Wrong Tests 1 Total Cost $1.017 Response Time (avg) 2.99s
#44	Claude Sonnet 4.6 medium	Anthropic	6.5	7.8	$2.057	2/4	2.98s
Total Tests 4 Wrong Tests 2 Total Cost $2.057 Response Time (avg) 2.98s
#67	Claude Sonnet 4.6 none	Anthropic	4.8	7.3	$0.661	1/4	2.94s
Total Tests 4 Wrong Tests 3 Total Cost $0.661 Response Time (avg) 2.94s
#117	LongCat 2.0 none	Meituan	4.8	6.3	$0.044	1/4	2.87s
Total Tests 4 Wrong Tests 3 Total Cost $0.044 Response Time (avg) 2.87s
#115	Mimo V2 PRO medium	Xiaomi	10.0	6.3	$0.333	4/4	2.86s
Total Tests 4 Wrong Tests 0 Total Cost $0.333 Response Time (avg) 2.86s
#176	GLM 5 Turbo none	Z.ai	3.0	5.1	$0.047	0/4	2.84s
Total Tests 4 Wrong Tests 4 Total Cost $0.047 Response Time (avg) 2.84s
#158	Qwen3.6 27B none	Qwen	3.8	5.5	$0.087	0/4	2.83s
Total Tests 4 Wrong Tests 4 Total Cost $0.087 Response Time (avg) 2.83s
#7	GPT-5.6 Sol medium	OpenAI	10.0	9.4	$1.316	4/4	2.81s
Total Tests 4 Wrong Tests 0 Total Cost $1.316 Response Time (avg) 2.81s
#154	Owl Alpha none	Openrouter	3.4	5.6	$0.000	0/4	2.78s
Total Tests 4 Wrong Tests 4 Total Cost $0.000 Response Time (avg) 2.78s
#23	Grok 4.5 low	X AI	10.0	8.4	$0.935	4/4	2.75s
Total Tests 4 Wrong Tests 0 Total Cost $0.935 Response Time (avg) 2.75s
#140	Mimo V2 Omni medium	Xiaomi	10.0	5.9	$0.683	4/4	2.75s
Total Tests 4 Wrong Tests 0 Total Cost $0.683 Response Time (avg) 2.75s
#188	KAT-Coder-Air V2.5 none	Kwaipilot	5.3	4.8	$0.067	1/4	2.68s
Total Tests 4 Wrong Tests 3 Total Cost $0.067 Response Time (avg) 2.68s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)