AI BENCHY
Advertise here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average General Intelligence Score

5.9

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#117 Qwen3.5-35B-A3B none Qwen 6.5 5.6 0/1 1.19s
#150 Qwen3 Coder Next medium Qwen 6.3 4.6 0/1 1.39s
#17 GLM 5 medium Z.ai 6.1 8.3 0/1 14.7s
#23 GLM 5 Turbo medium Z.ai 6.1 8.0 0/1 10.1s
#30 Qwen3.5-27B medium Qwen 6.1 7.8 0/1 101.4s
#31 DeepSeek V4 Flash high DeepSeek 6.1 7.7 0/1 25.2s
#49 Qwen3.5-Flash medium Qwen 6.1 7.4 0/1 40.1s
#77 Claude Sonnet 4.6 none Anthropic 6.1 6.8 0/1 2.56s
#103 DeepSeek V4 Pro high DeepSeek 6.1 6.0 0/1 25.1s
#116 Hunter Alpha none OpenRouter 6.1 5.7 0/1 2.71s
#84 Grok 4.20 Multi Agent Beta medium X AI 5.8 6.6 0/1 6.40s
#43 MiMo-V2.5-Pro medium Xiaomi 5.5 7.5 0/1 4.02s
#62 Step 3.5 Flash medium Stepfun 5.5 7.2 0/1 22.4s
#71 Step 3.7 Flash high Stepfun 5.5 7.0 0/1 4.17s
#38 Grok 4.3 medium X AI 5.4 7.6 0/1 24.7s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)