AI BENCHY
Advertise here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.

Models Shown

15

Average General Intelligence Score

5.9

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#66 Qwen3.5-35B-A3B medium Qwen 2.8 7.1 0/1 30.3s
#67 MiniMax M3 medium Minimax 5.1 7.1 0/1 33.3s
#70 GPT-5.4 Nano medium OpenAI 4.5 7.0 0/1 4.15s
#71 Step 3.7 Flash high Stepfun 5.5 7.0 0/1 4.17s
#72 DeepSeek V3.2 medium DeepSeek 3.4 7.0 0/1 58.3s
#73 Seed-2.0-Mini medium Bytedance Seed 5.1 6.9 0/1 36.7s
#74 Qwen3.6 Max Preview none Qwen 4.3 6.9 0/1 1.62s
#75 Ring-2.6-1T medium Inclusionai 4.1 6.9 0/1 58.3s
#76 Kimi K2.5 medium Moonshot AI 6.5 6.8 0/1 69.7s
#77 Claude Sonnet 4.6 none Anthropic 6.1 6.8 0/1 2.56s
#78 Qwen3.6 27B medium Qwen 6.5 6.8 0/1 39.5s
#79 Hunter Alpha medium OpenRouter 7.0 6.7 0/1 6.44s
#80 Mimo V2 Omni medium Xiaomi 5.4 6.7 0/1 3.61s
#81 Mercury 2 medium Inception 4.8 6.6 0/1 821ms
#82 Hy3 preview high Tencent 3.0 6.6 0/1 0ms

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)