Anti-AI Tricks Model Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.

Models Shown

Average Anti-AI Tricks Score

7.1

Best Model

Gemini 3 Flash Preview 10.0

Failure Reasons

With failure reason Wrong answer293 With failure reason Did not follow instructions33 With failure reason Extra formatting20 With failure reason API error14 With failure reason No answer4 With failure reason Timed out4

210/210

Rank	Model	Company	Anti-AI Tricks Score	Score	Total Cost	Tests Correct	Response Time (avg)
#72	Qwen3.5-122B-A10B medium	Qwen	10.0	7.1	$1.046	4/4	9.75s
Total Tests 4 Wrong Tests 0 Total Cost $1.046 Response Time (avg) 9.75s
#73	Grok 4.3 medium	X AI	10.0	7.1	$0.779	4/4	8.83s
Total Tests 4 Wrong Tests 0 Total Cost $0.779 Response Time (avg) 8.83s
#74	GLM 5.1 medium	Z.ai	10.0	7.1	$0.535	4/4	8.31s
Total Tests 4 Wrong Tests 0 Total Cost $0.535 Response Time (avg) 8.31s
#79	Gemini 3.5 Flash none	Google	10.0	7.0	$1.079	4/4	2.53s
Total Tests 4 Wrong Tests 0 Total Cost $1.079 Response Time (avg) 2.53s
#84	MiMo-V2.5-Pro medium	Xiaomi	10.0	6.9	$0.187	4/4	3.26s
Total Tests 4 Wrong Tests 0 Total Cost $0.187 Response Time (avg) 3.26s
#85	Qwen3.6 Flash medium	Qwen	10.0	6.9	$0.738	4/4	6.10s
Total Tests 4 Wrong Tests 0 Total Cost $0.738 Response Time (avg) 6.10s
#86	Step 3.7 Flash high	Stepfun	10.0	6.9	$1.207	4/4	13.4s
Total Tests 4 Wrong Tests 0 Total Cost $1.207 Response Time (avg) 13.4s
#90	Qwen3.6 35B A3B medium	Qwen	10.0	6.7	$0.746	4/4	6.02s
Total Tests 4 Wrong Tests 0 Total Cost $0.746 Response Time (avg) 6.02s
#91	LongCat 2.0 low	Meituan	10.0	6.7	$0.391	4/4	9.04s
Total Tests 4 Wrong Tests 0 Total Cost $0.391 Response Time (avg) 9.04s
#95	Gemma 4 26B A4B medium	Google	10.0	6.6	$0.089	4/4	6.20s
Total Tests 4 Wrong Tests 0 Total Cost $0.089 Response Time (avg) 6.20s
#100	Hy3 preview medium	Tencent	10.0	6.5	$0.018	4/4	6.59s
Total Tests 4 Wrong Tests 0 Total Cost $0.018 Response Time (avg) 6.59s
#101	MiMo-V2.5 medium	Xiaomi	10.0	6.5	$0.082	4/4	4.14s
Total Tests 4 Wrong Tests 0 Total Cost $0.082 Response Time (avg) 4.14s
#108	Ring-2.6-1T medium	Inclusionai	10.0	6.3	$0.103	4/4	42.2s
Total Tests 4 Wrong Tests 0 Total Cost $0.103 Response Time (avg) 42.2s
#109	Mimo V2 PRO medium	Xiaomi	10.0	6.3	$0.333	4/4	2.86s
Total Tests 4 Wrong Tests 0 Total Cost $0.333 Response Time (avg) 2.86s
#110	Gemma 4 31B medium	Google	10.0	6.3	$0.163	4/4	12.9s
Total Tests 4 Wrong Tests 0 Total Cost $0.163 Response Time (avg) 12.9s

Anti-AI Tricks Ranking

Filter models

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)