AI BENCHY
Advertise here

AI BENCHY Category

Domain specific Ranking

See which AI models perform best on Domain specific, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

15

Average Domain specific Score

4.8

Rank Model Company Domain specific Score Score Tests Correct Response Time (avg)
#50 Gemini 3.1 Flash Lite Preview low Google 5.3 7.4 1/3 2.36s
#116 Hunter Alpha none OpenRouter 5.3 5.7 1/3 2.33s
#98 GLM 5 none Z.ai 3.0 6.1 0/3 2.24s
#101 Mimo V2 Omni none Xiaomi 5.3 6.0 1/3 2.10s
#109 GLM 5V Turbo none Z.ai 5.3 5.8 1/3 2.09s
#112 GLM 5.1 none Z.ai 2.9 5.7 0/3 1.99s
#134 GLM 5 Turbo none Z.ai 5.3 5.2 1/3 1.97s
#120 Mimo V2 PRO none Xiaomi 5.3 5.6 1/3 1.78s
#68 Claude Opus 4.8 none Anthropic 5.3 7.0 1/3 1.66s
#61 Gemini 3.1 Flash Lite low Google 5.3 7.2 1/3 1.52s
#124 Kimi K2.6 none Moonshot AI 5.3 5.5 1/3 1.48s
#110 Seed-2.0-Lite none Bytedance Seed 3.6 5.8 0/3 1.33s
#91 GPT-5.5 none OpenAI 2.9 6.4 0/3 1.31s
#74 Qwen3.6 Max Preview none Qwen 7.7 6.9 2/3 1.22s
#8 Claude Opus 4.7 none Anthropic 7.7 8.9 2/3 1.19s

Top Models by Domain specific Score

Domain specific Score vs Total Cost

Top Models by Response Time (avg)