AI BENCHY
Your ad here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average General Intelligence Score

6.1

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#62 Gemini 2.5 Flash none Google 5.0 6.2 0/1 615ms
#67 Qwen3.5-27B none Qwen 5.0 5.9 0/1 2.51s
#70 Qwen3.5-122B-A10B none Qwen 5.0 5.7 0/1 1.12s
#75 GLM 5.1 none Z.ai 5.0 5.6 0/1 790ms
#79 Grok 4.20 Beta none X AI 5.0 5.3 0/1 541ms
#15 Gemini 2.5 Flash medium Google 4.8 8.2 0/1 4.86s
#54 Mercury 2 medium Inception 4.8 6.5 0/1 821ms
#73 Mistral Small 4 medium Mistral 4.8 5.7 0/1 2.05s
#82 Grok 4.20 none X AI 4.8 5.2 0/1 659ms
#86 GPT-5.4 Mini none OpenAI 4.8 5.1 0/1 1.82s
#91 Mercury 2 none Inception 4.8 4.8 0/1 628ms
#16 GPT-5.4 medium OpenAI 4.7 8.2 0/1 4.92s
#8 Qwen3.5 Plus 2026-02-15 medium Qwen 4.7 8.5 0/1 79.9s
#7 GPT-5.3-Codex medium OpenAI 4.6 8.6 0/1 4.87s
#36 GPT-5.3 Chat none OpenAI 4.6 7.7 0/1 1.99s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)