AI BENCHY
Your ad here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.

Models Shown

15

Average General Intelligence Score

6.1

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#56 Grok 4.20 Multi Agent Beta medium X AI 5.8 6.4 0/1 6.40s
#57 GPT-5 Nano medium OpenAI 4.1 6.3 0/1 17.5s
#58 GLM 5V Turbo none Z.ai 4.6 6.2 0/1 2.22s
#60 Gemma 4 26B A4B none Google 4.0 6.2 0/1 3.54s
#62 Gemini 2.5 Flash none Google 5.0 6.2 0/1 615ms
#63 Qwen3.5-35B-A3B none Qwen 6.5 6.1 0/1 1.19s
#65 MiMo-V2-Pro none Xiaomi 4.3 6.0 0/1 2.44s
#66 GPT-5.4 none OpenAI 4.4 5.9 0/1 1.78s
#67 Qwen3.5-27B none Qwen 5.0 5.9 0/1 2.51s
#68 gpt-oss-120b medium OpenAI 4.3 5.8 0/1 7.90s
#69 Kimi K2.6 none Moonshot AI 5.4 5.8 0/1 1.55s
#70 Qwen3.5-122B-A10B none Qwen 5.0 5.7 0/1 1.12s
#71 MiniMax M2.5 medium Minimax 3.8 5.7 0/1 6.63s
#72 Hunter Alpha none OpenRouter 6.1 5.7 0/1 2.71s
#73 Mistral Small 4 medium Mistral 4.8 5.7 0/1 2.05s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)