AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

Data parsing and extraction Ranking

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

15

Average Data parsing and extraction Score

9.0

Best Model

Qwen3.5-9B 3.6
Rank Model Company Data parsing and extraction Score Score Tests Correct Response Time (avg)
#17 Gemini 3.1 Flash Lite Preview medium Google 10.0 8.2 2/2 2.29s
#35 MiMo-V2-Omni medium Xiaomi 10.0 7.7 2/2 2.29s
#48 Gemma 4 31B none Google 10.0 6.9 2/2 2.25s
#36 GPT-5.3 Chat none OpenAI 10.0 7.7 2/2 2.21s
#4 Claude Opus 4.7 none Anthropic 10.0 9.2 2/2 2.15s
#68 gpt-oss-120b medium OpenAI 6.4 5.8 1/2 1.98s
#49 Qwen3.5 Plus 2026-02-15 none Qwen 10.0 6.8 2/2 1.89s
#61 Seed-2.0-Lite none Bytedance Seed 10.0 6.2 2/2 1.82s
#60 Gemma 4 26B A4B none Google 10.0 6.2 2/2 1.70s
#55 MiMo-V2-Omni none Xiaomi 10.0 6.5 2/2 1.69s
#59 Qwen3.5-Flash none Qwen 10.0 6.2 2/2 1.57s
#93 GLM 4.7 Flash medium Z.ai 6.3 4.6 1/2 1.51s
#67 Qwen3.5-27B none Qwen 10.0 5.9 2/2 1.43s
#21 Gemini 3 Flash Preview none Google 10.0 8.1 2/2 1.41s
#65 MiMo-V2-Pro none Xiaomi 10.0 6.0 2/2 1.39s

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)