AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

Data parsing and extraction Ranking

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↑.

Models Shown

15

Average Data parsing and extraction Score

8.7

Rank Model Company Data parsing and extraction Score Score Tests Correct Response Time (avg)
#84 Grok 4.20 Multi Agent Beta medium X AI 10.0 6.6 2/2 5.54s
#41 Nemotron 3 Ultra 550b A55b medium NVIDIA 10.0 7.5 2/2 5.68s
#98 GLM 5 none Z.ai 10.0 6.1 2/2 5.78s
#89 Hy3 preview low Tencent 6.5 6.4 1/2 5.85s
#23 GLM 5 Turbo medium Z.ai 10.0 8.0 2/2 6.19s
#56 MiMo-V2.5 medium Xiaomi 2.7 7.3 0/2 6.33s
#2 Gemini 3.5 Flash high Google 10.0 9.6 2/2 6.43s
#86 Grok 4.1 Fast medium X AI 10.0 6.5 2/2 6.63s
#126 gpt-oss-120b none OpenAI 6.5 5.4 1/2 7.12s
#12 Gemini 3.1 Flash Lite Preview high Google 10.0 8.6 2/2 7.16s
#69 Claude Opus 4.6 medium Anthropic 10.0 7.0 2/2 7.37s
#129 MiniMax M2.5 medium Minimax 4.6 5.3 0/2 7.48s
#4 Gemini 3.1 Pro Preview medium Google 10.0 9.4 2/2 7.72s
#141 Nemotron 3 Super none NVIDIA 10.0 4.9 2/2 7.92s
#20 Gemini 3.5 Flash none Google 6.5 8.1 1/2 8.10s

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)