Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Tes benar ↑.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
GPT-5 Nano 3.7| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #39 | Seed-2.0-Mini medium | Bytedance Seed | 10.0 | 7.5 | 2/2 | 24.3s |
| #40 | GPT-5.2 medium | OpenAI | 10.0 | 7.5 | 2/2 | 3.15s |
| #42 | Claude Sonnet 4.6 none | Anthropic | 10.0 | 7.4 | 2/2 | 3.43s |
| #44 | GPT-5.4 Mini medium | OpenAI | 10.0 | 7.3 | 2/2 | 2.43s |
| #45 | GPT-5 Mini medium | OpenAI | 10.0 | 7.0 | 2/2 | 12.6s |
| #46 | Kimi K2.5 medium | Moonshot AI | 10.0 | 7.0 | 2/2 | 49.8s |
| #47 | Grok 4.20 medium | X AI | 10.0 | 7.0 | 2/2 | 4.17s |
| #48 | Gemma 4 31B none | 10.0 | 6.9 | 2/2 | 2.25s | |
| #49 | Qwen3.5 Plus 2026-02-15 none | Qwen | 10.0 | 6.8 | 2/2 | 1.89s |
| #50 | Hunter Alpha medium | OpenRouter | 10.0 | 6.7 | 2/2 | 23.2s |
| #51 | Nemotron 3 Super medium | NVIDIA | 10.0 | 6.7 | 2/2 | 18.2s |
| #52 | Grok 4.1 Fast medium | X AI | 10.0 | 6.7 | 2/2 | 6.63s |
| #53 | GLM 5 none | Z.ai | 10.0 | 6.6 | 2/2 | 5.78s |
| #55 | MiMo-V2-Omni none | Xiaomi | 10.0 | 6.5 | 2/2 | 1.69s |
| #56 | Grok 4.20 Multi Agent Beta medium | X AI | 10.0 | 6.4 | 2/2 | 5.54s |