AI BENCHY
Advertise here

AI BENCHY Category

Domain specific Ranking

See which AI models perform best on Domain specific, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

15

Average Domain specific Score

4.8

Rank Model Company Domain specific Score Score Tests Correct Response Time (avg)
#78 Qwen3.6 27B medium Qwen 2.9 6.8 0/3 73.4s
#23 GLM 5 Turbo medium Z.ai 2.9 8.0 0/3 71.1s
#45 GPT-5.4 Mini medium OpenAI 4.1 7.5 0/3 65.3s
#75 Ring-2.6-1T medium Inclusionai 3.5 6.9 0/3 64.9s
#15 GPT-5.3-Codex medium OpenAI 5.9 8.4 1/3 64.3s
#29 Qwen3.5-122B-A10B medium Qwen 2.9 7.8 0/3 63.4s
#149 Nemotron 3 Nano Omni 30b A3b Reasoning medium NVIDIA 2.9 4.6 0/3 56.7s
#36 Qwen3.5 Plus 2026-04-20 medium Qwen 2.9 7.6 0/3 53.1s
#99 gpt-oss-120b medium OpenAI 2.9 6.1 0/3 50.9s
#22 Step 3.7 Flash medium Stepfun 7.7 8.0 2/3 48.3s
#80 Mimo V2 Omni medium Xiaomi 3.0 6.7 0/3 47.9s
#18 Qwen3.7 Plus medium Qwen 3.6 8.2 0/3 45.3s
#54 GPT-5 Mini medium OpenAI 3.6 7.3 0/3 44.6s
#57 Step 3.7 Flash low Stepfun 5.3 7.3 1/3 43.3s
#89 Hy3 preview low Tencent 5.9 6.4 1/3 40.4s

Top Models by Domain specific Score

Domain specific Score vs Total Cost

Top Models by Response Time (avg)