Data parsing and extraction Model Ranking

AI BENCHY Category

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Response Time (avg) ↑.

Models Shown

Average Data parsing and extraction Score

8.7

Best Model

MiMo-V2-Flash 6.5

Failure Reasons

With failure reason Wrong answer35 With failure reason API error16 With failure reason No answer5 With failure reason Extra formatting4 With failure reason Timed out1

Rank	Model	Company	Data parsing and extraction Score	Score	Tests Correct	Response Time (avg)
#64	MiMo-V2-Flash medium	Xiaomi	6.5	7.2	1/2	0ms
#83	Step 3.5 Flash none	Stepfun	3.0	6.6	0/1	0ms
#127	Grok 4.20 none	X AI	10.0	5.4	2/2	522ms
#163	Granite 4.1 8B none	IBM Granite	3.0	4.0	0/2	575ms
#106	Grok 4.20 Beta none	X AI	10.0	5.8	2/2	601ms
#146	Laguna Xs.2 none	Poolside	10.0	4.8	2/2	646ms
#97	Gemini 2.5 Flash none	Google	10.0	6.2	2/2	652ms
#155	Mercury 2 none	Inception	7.3	4.5	1/2	667ms
#160	LFM2-24B-A2B none	Liquid	3.0	4.2	0/2	714ms
#142	Mistral Small 4 none	Mistral	10.0	4.9	2/2	822ms
#90	Gemini 3.1 Flash Lite none	Google	10.0	6.4	2/2	843ms
#154	Qwen3.5-9B none	Qwen	10.0	4.6	2/2	847ms
#157	Grok 4.1 Fast none	X AI	10.0	4.4	2/2	943ms
#136	Elephant Alpha medium	Openrouter	6.5	5.1	1/2	979ms
#131	Qwen3.5-122B-A10B none	Qwen	10.0	5.3	2/2	1.01s

Data parsing and extraction Ranking

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)