AI BENCHY
Your ad here

AI BENCHY Category

Combined Ranking

See which AI models perform best on Combined, which ones stay reliable, and where the biggest gaps appear.

Models Shown

15

Average Combined Score

6.2

Rank Model Company Combined Score Score Tests Correct Response Time (avg)
#21 Gemini 3 Flash Preview none Google 4.7 8.1 0/1 3.56s
#23 MiMo-V2-Pro medium Xiaomi 4.7 8.1 0/1 64.7s
#43 Qwen3.5-35B-A3B medium Qwen 4.7 7.4 0/1 75.3s
#50 Hunter Alpha medium OpenRouter 4.7 6.7 0/1 30.5s
#80 MiniMax M2.7 medium Minimax 4.7 5.3 0/1 41.0s
#71 MiniMax M2.5 medium Minimax 4.5 5.7 0/1 60.4s
#5 Gemini 3 Flash Preview low Google 3.0 8.8 0/1 3.27s
#12 Gemini 3 PRO Preview medium Google 3.0 8.4 0/1 10.4s
#14 Gemma 4 31B medium Google 3.0 8.3 0/1 0ms
#22 Gemini 3.1 Flash Lite Preview low Google 3.0 8.1 0/1 11.9s
#29 Gemini 3.1 Flash Lite Preview none Google 3.0 7.9 0/1 3.20s
#48 Gemma 4 31B none Google 3.0 6.9 0/1 0ms
#49 Qwen3.5 Plus 2026-02-15 none Qwen 3.0 6.8 0/1 6.65s
#53 GLM 5 none Z.ai 3.0 6.6 0/1 4.98s
#55 MiMo-V2-Omni none Xiaomi 3.0 6.5 0/1 2.47s

Top Models by Combined Score

Combined Score vs Total Cost

Top Models by Response Time (avg)