Data parsing and extraction Model Ranking

See which AI models perform best on Data parsing and extraction, which ones stay reliable, and where the biggest gaps appear. Sort by: Metric ↑.

Models Shown

Average Data parsing and extraction Score

8.9

Best Model

Step 3.5 Flash 1.5

Failure Reasons

With failure reason Wrong answer41 With failure reason API error14 With failure reason No answer8 With failure reason Extra formatting6 With failure reason Timed out1

216/216

Rank	Model	Company	Data parsing and extraction Score	Score	Total Cost	Tests Correct	Response Time (avg)
#34	GPT-5.2 Chat none	OpenAI	10.0	8.0	$0.604	2/2	3.05s
Total Tests 2 Wrong Tests 0 Total Cost $0.604 Response Time (avg) 3.05s
#35	GLM 5.2 high	Z.ai	10.0	8.0	$0.817	2/2	5.81s
Total Tests 2 Wrong Tests 0 Total Cost $0.817 Response Time (avg) 5.81s
#36	Inkling medium	Thinkingmachines	10.0	8.0	$0.391	2/2	3.60s
Total Tests 2 Wrong Tests 0 Total Cost $0.391 Response Time (avg) 3.60s
#38	GPT-5.6 Terra high	OpenAI	10.0	8.0	$1.055	2/2	938ms
Total Tests 2 Wrong Tests 0 Total Cost $1.055 Response Time (avg) 938ms
#39	Seed-2.0-Lite medium	Bytedance Seed	10.0	7.9	$0.234	2/2	9.07s
Total Tests 2 Wrong Tests 0 Total Cost $0.234 Response Time (avg) 9.07s
#40	Qwen3.7 Plus medium	Qwen	10.0	7.9	$0.267	2/2	21.7s
Total Tests 2 Wrong Tests 0 Total Cost $0.267 Response Time (avg) 21.7s
#41	Qwen3.6 Plus medium	Qwen	10.0	7.8	$0.405	2/2	14.9s
Total Tests 2 Wrong Tests 0 Total Cost $0.405 Response Time (avg) 14.9s
#42	GLM 5.2 medium	Z.ai	10.0	7.8	$0.187	2/2	13.4s
Total Tests 2 Wrong Tests 0 Total Cost $0.187 Response Time (avg) 13.4s
#43	GPT-5.6 Terra medium	OpenAI	10.0	7.8	$0.676	2/2	872ms
Total Tests 2 Wrong Tests 0 Total Cost $0.676 Response Time (avg) 872ms
#44	Claude Sonnet 4.6 medium	Anthropic	10.0	7.8	$2.057	2/2	13.9s
Total Tests 2 Wrong Tests 0 Total Cost $2.057 Response Time (avg) 13.9s
#47	Claude Opus 4.6 medium	Anthropic	10.0	7.7	$3.059	2/2	7.37s
Total Tests 2 Wrong Tests 0 Total Cost $3.059 Response Time (avg) 7.37s
#48	GPT-5.6 Luna high	OpenAI	10.0	7.7	$1.017	2/2	2.18s
Total Tests 2 Wrong Tests 0 Total Cost $1.017 Response Time (avg) 2.18s
#50	DeepSeek V4 Pro high	DeepSeek	10.0	7.7	$0.200	2/2	25.0s
Total Tests 2 Wrong Tests 0 Total Cost $0.200 Response Time (avg) 25.0s
#51	MiniMax M3 medium	Minimax	10.0	7.6	$0.286	2/2	14.9s
Total Tests 2 Wrong Tests 0 Total Cost $0.286 Response Time (avg) 14.9s
#52	Grok Build 0.1 medium	X AI	10.0	7.6	$1.097	2/2	10.7s
Total Tests 2 Wrong Tests 0 Total Cost $1.097 Response Time (avg) 10.7s

Data parsing and extraction Ranking

Filter models

Top Models by Data parsing and extraction Score

Data parsing and extraction Score vs Total Cost

Top Models by Response Time (avg)