Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
8.7
Model terbaik
DeepSeek V4 Flash 10.0| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #14 | Qwen3.6 Max Preview medium | Qwen | 10.0 | 8.5 | 2/2 | 41.2s |
| #15 | GPT-5.3-Codex medium | OpenAI | 10.0 | 8.4 | 2/2 | 3.07s |
| #16 | Gemini 3 Flash Preview low | 10.0 | 8.4 | 2/2 | 9.40s | |
| #18 | Qwen3.7 Plus medium | Qwen | 10.0 | 8.2 | 2/2 | 21.7s |
| #19 | Seed-2.0-Lite medium | Bytedance Seed | 10.0 | 8.2 | 2/2 | 9.07s |
| #21 | GPT-5.4 medium | OpenAI | 10.0 | 8.0 | 2/2 | 5.32s |
| #22 | Step 3.7 Flash medium | Stepfun | 10.0 | 8.0 | 2/2 | 2.75s |
| #23 | GLM 5 Turbo medium | Z.ai | 10.0 | 8.0 | 2/2 | 6.19s |
| #24 | GPT-5.2 Chat none | OpenAI | 10.0 | 7.9 | 2/2 | 3.05s |
| #25 | Qwen3.5 Plus 2026-02-15 medium | Qwen | 10.0 | 7.9 | 2/2 | 46.9s |
| #26 | Qwen3.6 Plus medium | Qwen | 10.0 | 7.9 | 2/2 | 14.9s |
| #27 | Gemma 4 31B medium | 10.0 | 7.8 | 2/2 | 21.1s | |
| #28 | Gemini 2.5 Flash medium | 10.0 | 7.8 | 2/2 | 4.06s | |
| #29 | Qwen3.5-122B-A10B medium | Qwen | 10.0 | 7.8 | 2/2 | 23.4s |
| #30 | Qwen3.5-27B medium | Qwen | 10.0 | 7.8 | 2/2 | 30.3s |