Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Metrik ↑.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
MiMo-V2-Flash 2.9| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #22 | Gemini 3.1 Flash Lite Preview low | 10.0 | 8.1 | 2/2 | 3.00s | |
| #24 | Gemma 4 26B A4B medium | 10.0 | 8.0 | 2/2 | 16.5s | |
| #25 | Grok 4.20 Beta medium | X AI | 10.0 | 8.0 | 2/2 | 4.01s |
| #26 | Claude Sonnet 4.6 medium | Anthropic | 10.0 | 8.0 | 2/2 | 13.9s |
| #27 | DeepSeek V3.2 medium | DeepSeek | 10.0 | 8.0 | 2/2 | 36.1s |
| #28 | GPT-5.2 Chat none | OpenAI | 10.0 | 7.9 | 2/2 | 3.05s |
| #29 | Gemini 3.1 Flash Lite Preview none | 10.0 | 7.9 | 2/2 | 1.22s | |
| #31 | GLM 5V Turbo medium | Z.ai | 10.0 | 7.8 | 2/2 | 9.60s |
| #33 | GLM 5.1 medium | Z.ai | 10.0 | 7.8 | 2/2 | 9.33s |
| #34 | Kimi K2.6 medium | Moonshot AI | 10.0 | 7.7 | 2/2 | 20.4s |
| #35 | MiMo-V2-Omni medium | Xiaomi | 10.0 | 7.7 | 2/2 | 2.29s |
| #36 | GPT-5.3 Chat none | OpenAI | 10.0 | 7.7 | 2/2 | 2.21s |
| #37 | Claude Opus 4.6 medium | Anthropic | 10.0 | 7.6 | 2/2 | 7.37s |
| #38 | GPT-5.4 Nano medium | OpenAI | 10.0 | 7.6 | 2/2 | 2.54s |
| #39 | Seed-2.0-Mini medium | Bytedance Seed | 10.0 | 7.5 | 2/2 | 24.3s |