AI BENCHY
Your ad here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

15

Average General Intelligence Score

6.1

Best Model

Qwen3.5-9B 2.8
Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#97 Qwen3.5-9B medium Qwen 2.8 4.4 0/1 226.4s
#10 Qwen3.5-27B medium Qwen 6.1 8.4 0/1 101.4s
#8 Qwen3.5 Plus 2026-02-15 medium Qwen 4.7 8.5 0/1 79.9s
#46 Kimi K2.5 medium Moonshot AI 6.5 7.0 0/1 69.7s
#32 Qwen3.5-Flash medium Qwen 6.1 7.8 0/1 40.1s
#80 MiniMax M2.7 medium Minimax 3.9 5.3 0/1 38.7s
#39 Seed-2.0-Mini medium Bytedance Seed 5.1 7.5 0/1 36.7s
#19 Qwen3.5-122B-A10B medium Qwen 3.4 8.1 0/1 34.1s
#27 DeepSeek V3.2 medium DeepSeek 5.4 8.0 0/1 31.3s
#43 Qwen3.5-35B-A3B medium Qwen 2.8 7.4 0/1 30.3s
#24 Gemma 4 26B A4B medium Google 10.0 8.0 1/1 29.8s
#51 Nemotron 3 Super medium NVIDIA 3.8 6.7 0/1 27.9s
#9 Qwen3.6 Plus Preview medium Qwen 5.1 8.5 0/1 27.1s
#20 Qwen3.6 Plus medium Qwen 5.1 8.1 0/1 27.1s
#88 Nemotron 3 Super none NVIDIA 4.2 5.1 0/1 25.0s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)