AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

Data parsing and extraction Ranking

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↓.

Models Shown

13

Average Data parsing and extraction Score

8.7

Best Model

Qwen3.5-9B 3.6
Rank Model Company Data parsing and extraction Score Score Tests Correct Response Time (avg)
#157 Grok 4.1 Fast none X AI 10.0 4.4 2/2 943ms
#154 Qwen3.5-9B none Qwen 10.0 4.6 2/2 847ms
#90 Gemini 3.1 Flash Lite none Google 10.0 6.4 2/2 843ms
#142 Mistral Small 4 none Mistral 10.0 4.9 2/2 822ms
#160 LFM2-24B-A2B none Liquid 3.0 4.2 0/2 714ms
#155 Mercury 2 none Inception 7.3 4.5 1/2 667ms
#97 Gemini 2.5 Flash none Google 10.0 6.2 2/2 652ms
#146 Laguna Xs.2 none Poolside 10.0 4.8 2/2 646ms
#106 Grok 4.20 Beta none X AI 10.0 5.8 2/2 601ms
#163 Granite 4.1 8B none IBM Granite 3.0 4.0 0/2 575ms
#127 Grok 4.20 none X AI 10.0 5.4 2/2 522ms
#64 MiMo-V2-Flash medium Xiaomi 6.5 7.2 1/2 0ms
#83 Step 3.5 Flash none Stepfun 3.0 6.6 0/1 0ms

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)