AI BENCHY
Advertise here

AI BENCHY Category

General Intelligence Ranking

See which AI models perform best on General Intelligence, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average General Intelligence Score

5.9

Rank Model Company General Intelligence Score Score Tests Correct Response Time (avg)
#159 Ling-2.6-1T none Inclusionai 5.0 4.3 0/1 20.3s
#36 Qwen3.5 Plus 2026-04-20 medium Qwen 4.9 7.6 0/1 25.3s
#39 Qwen3.6 Flash medium Qwen 4.8 7.5 0/1 9.88s
#28 Gemini 2.5 Flash medium Google 4.8 7.8 0/1 4.86s
#81 Mercury 2 medium Inception 4.8 6.6 0/1 821ms
#114 Qwen3.5 Plus 2026-04-20 none Qwen 4.8 5.7 0/1 1.41s
#126 gpt-oss-120b none OpenAI 4.8 5.4 0/1 10.8s
#127 Grok 4.20 none X AI 4.8 5.4 0/1 659ms
#132 Mistral Small 4 medium Mistral 4.8 5.3 0/1 2.05s
#144 GPT-5.4 Mini none OpenAI 4.8 4.9 0/1 1.82s
#155 Mercury 2 none Inception 4.8 4.5 0/1 628ms
#21 GPT-5.4 medium OpenAI 4.7 8.0 0/1 4.92s
#25 Qwen3.5 Plus 2026-02-15 medium Qwen 4.7 7.9 0/1 79.9s
#133 DeepSeek V3.2 none DeepSeek 4.7 5.2 0/1 9.32s
#15 GPT-5.3-Codex medium OpenAI 4.6 8.4 0/1 4.87s

Top Models by General Intelligence Score

General Intelligence Score vs Total Cost

Top Models by Response Time (avg)