AI BENCHY
Advertise here

AI BENCHY Category

Anti-AI Tricks Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↑.

Models Shown

15

Average Anti-AI Tricks Score

6.9

Rank Model Company Anti-AI Tricks Score Score Tests Correct Response Time (avg)
#33 Hy3 preview medium Tencent 10.0 7.7 4/4 6.59s
#47 Grok Build 0.1 medium X AI 8.3 7.4 3/4 7.43s
#69 Claude Opus 4.6 medium Anthropic 6.4 7.0 2/4 7.45s
#42 GPT-5.2 medium OpenAI 6.5 7.5 2/4 7.81s
#105 Nemotron 3 Super medium NVIDIA 8.3 5.8 3/4 7.85s
#4 Gemini 3.1 Pro Preview medium Google 10.0 9.4 4/4 7.90s
#55 GLM 5.1 medium Z.ai 10.0 7.3 4/4 8.31s
#18 Qwen3.7 Plus medium Qwen 10.0 8.2 4/4 8.58s
#41 Nemotron 3 Ultra 550b A55b medium NVIDIA 10.0 7.5 4/4 8.62s
#150 Qwen3 Coder Next medium Qwen 3.5 4.6 0/4 8.64s
#38 Grok 4.3 medium X AI 10.0 7.6 4/4 8.83s
#89 Hy3 preview low Tencent 8.3 6.4 3/4 9.32s
#133 DeepSeek V3.2 none DeepSeek 3.2 5.2 0/4 9.35s
#22 Step 3.7 Flash medium Stepfun 8.7 8.0 3/4 9.65s
#29 Qwen3.5-122B-A10B medium Qwen 10.0 7.8 4/4 9.75s

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)