AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Category

Data parsing and extraction Ranking

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Tests Correct ↓.

Models Shown

15

Average Data parsing and extraction Score

9.0

Rank Model Company Data parsing and extraction Score Score Tests Correct Response Time (avg)
#32 Qwen3.5-Flash medium Qwen 7.3 7.8 1/2 57.0s
#41 MiMo-V2-Flash medium Xiaomi 6.5 7.5 1/2 0ms
#43 Qwen3.5-35B-A3B medium Qwen 7.3 7.4 1/2 59.3s
#54 Mercury 2 medium Inception 7.3 6.5 1/2 1.11s
#64 DeepSeek V3.2 none DeepSeek 6.3 6.1 1/2 9.42s
#68 gpt-oss-120b medium OpenAI 6.4 5.8 1/2 1.98s
#73 Mistral Small 4 medium Mistral 7.3 5.7 1/2 1.23s
#74 GLM 4.7 Flash none Z.ai 7.3 5.6 1/2 4.82s
#76 Kimi K2.5 none Moonshot AI 7.3 5.5 1/2 42.1s
#80 MiniMax M2.7 medium Minimax 6.3 5.3 1/2 21.9s
#81 Elephant medium Openrouter 6.5 5.2 1/2 979ms
#84 gpt-oss-120b none OpenAI 6.5 5.2 1/2 7.12s
#85 Elephant none Openrouter 6.5 5.2 1/2 1.04s
#87 Qwen3 Coder Next none Qwen 6.5 5.1 1/2 1.32s
#91 Mercury 2 none Inception 7.3 4.8 1/2 667ms

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)