AI BENCHY
Advertise here

AI BENCHY Category

Anti-AI Tricks Ranking

See which AI models perform best on Anti-AI Tricks, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average Anti-AI Tricks Score

6.9

Rank Model Company Anti-AI Tricks Score Score Tests Correct Response Time (avg)
#127 Grok 4.20 none X AI 4.8 5.4 1/4 501ms
#131 Qwen3.5-122B-A10B none Qwen 4.8 5.3 1/4 1.59s
#147 GPT-4o-mini none OpenAI 4.8 4.8 1/4 1.34s
#156 Hy3 preview none Tencent 4.8 4.4 1/4 11.1s
#162 Nemotron 3 Nano Omni 30b A3b Reasoning none NVIDIA 4.8 4.1 1/4 584ms
#158 GLM 4.7 Flash medium Z.ai 4.7 4.4 1/4 15.0s
#124 Kimi K2.6 none Moonshot AI 4.6 5.5 1/4 1.39s
#106 Grok 4.20 Beta none X AI 4.0 5.8 0/4 597ms
#112 GLM 5.1 none Z.ai 4.0 5.7 0/4 2.11s
#118 Qwen3.6 27B none Qwen 3.8 5.6 0/4 2.83s
#101 Mimo V2 Omni none Xiaomi 3.6 6.0 0/4 1.63s
#153 Qwen3.6 35B A3B none Qwen 3.6 4.6 0/4 2.10s
#135 Kimi K2.5 none Moonshot AI 3.6 5.2 0/4 6.24s
#140 Qwen3 Coder Next none Qwen 3.6 4.9 0/4 3.31s
#104 Nemotron 3 Ultra 550b A55b none NVIDIA 3.5 6.0 0/4 2.35s

Top Models by Anti-AI Tricks Score

Anti-AI Tricks Score vs Total Cost

Top Models by Response Time (avg)