AI BENCHY
Advertise here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average General Intelligence Score

5.9

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#123 MiMo-V2.5-Pro none Xiaomi 4.0 5.5 0/1 2.58s
#137 Elephant Alpha none Openrouter 4.0 5.1 0/1 854ms
#138 Ling-2.6-flash none Inclusionai 4.0 5.0 0/1 1.45s
#142 Mistral Small 4 none Mistral 4.0 4.9 0/1 729ms
#147 GPT-4o-mini none OpenAI 4.0 4.8 0/1 909ms
#160 LFM2-24B-A2B none Liquid 4.0 4.2 0/1 395ms
#163 Granite 4.1 8B none IBM Granite 4.0 4.0 0/1 499ms
#65 Grok 4.20 medium X AI 3.9 7.1 0/1 24.5s
#130 MiniMax M2.7 medium Minimax 3.9 5.3 0/1 38.7s
#129 MiniMax M2.5 medium Minimax 3.8 5.3 0/1 6.63s
#148 GPT-5.4 Nano none OpenAI 3.8 4.7 0/1 1.31s
#42 GPT-5.2 medium OpenAI 3.7 7.5 0/1 4.32s
#41 Nemotron 3 Ultra 550b A55b medium NVIDIA 3.7 7.5 0/1 2.52s
#158 GLM 4.7 Flash medium Z.ai 3.6 4.4 0/1 18.1s
#57 Step 3.7 Flash low Stepfun 3.4 7.3 0/1 7.00s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)