AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

Domain specific Ranking

See which AI models perform best on Domain specific, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

8

Average Domain specific Score

4.8

Rank Model Company Domain specific Score Score Tests Correct Response Time (avg)
#63 Qwen3.5-35B-A3B none Qwen 7.7 6.1 2/3 485ms
#70 Qwen3.5-122B-A10B none Qwen 5.3 5.7 1/3 465ms
#90 Qwen3.5-9B none Qwen 3.0 4.8 0/3 464ms
#83 Mistral Small 4 none Mistral 5.3 5.2 1/3 367ms
#98 LFM2-24B-A2B none Liquid 5.9 4.1 1/3 287ms
#13 GLM 5 medium Z.ai 3.5 8.4 0/3 0ms
#26 Claude Sonnet 4.6 medium Anthropic 2.9 8.0 0/3 0ms
#39 Seed-2.0-Mini medium Bytedance Seed 3.0 7.5 0/3 0ms

Top Models by Domain specific Score

Domain specific Score vs Total Cost

Top Models by Response Time (avg)