AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average General Intelligence Score

6.1

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#92 Qwen3 Coder Next medium Qwen 6.3 4.7 0/1 1.39s
#10 Qwen3.5-27B medium Qwen 6.1 8.4 0/1 101.4s
#13 GLM 5 medium Z.ai 6.1 8.4 0/1 14.7s
#18 GLM 5 Turbo medium Z.ai 6.1 8.1 0/1 10.1s
#32 Qwen3.5-Flash medium Qwen 6.1 7.8 0/1 40.1s
#42 Claude Sonnet 4.6 none Anthropic 6.1 7.4 0/1 2.56s
#72 Hunter Alpha none OpenRouter 6.1 5.7 0/1 2.71s
#47 Grok 4.20 medium X AI 5.8 7.0 0/1 7.09s
#56 Grok 4.20 Multi Agent Beta medium X AI 5.8 6.4 0/1 6.40s
#30 Step 3.5 Flash medium Stepfun 5.5 7.9 0/1 6.54s
#27 DeepSeek V3.2 medium DeepSeek 5.4 8.0 0/1 31.3s
#69 Kimi K2.6 none Moonshot AI 5.4 5.8 0/1 1.55s
#39 Seed-2.0-Mini medium Bytedance Seed 5.1 7.5 0/1 36.7s
#9 Qwen3.6 Plus Preview medium Qwen 5.1 8.5 0/1 27.1s
#20 Qwen3.6 Plus medium Qwen 5.1 8.1 0/1 27.1s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)