Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
Step 3.5 Flash 10.0| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #54 | Mercury 2 medium | Inception | 7.3 | 6.5 | 1/2 | 1.11s |
| #73 | Mistral Small 4 medium | Mistral | 7.3 | 5.7 | 1/2 | 1.23s |
| #91 | Mercury 2 none | Inception | 7.3 | 4.8 | 1/2 | 667ms |
| #23 | MiMo-V2-Pro medium | Xiaomi | 7.3 | 8.1 | 1/2 | 17.2s |
| #74 | GLM 4.7 Flash none | Z.ai | 7.3 | 5.6 | 1/2 | 4.82s |
| #76 | Kimi K2.5 none | Moonshot AI | 7.3 | 5.5 | 1/2 | 42.1s |
| #13 | GLM 5 medium | Z.ai | 7.1 | 8.4 | 1/2 | 8.90s |
| #41 | MiMo-V2-Flash medium | Xiaomi | 6.5 | 7.5 | 1/2 | 0ms |
| #84 | gpt-oss-120b none | OpenAI | 6.5 | 5.2 | 1/2 | 7.12s |
| #81 | Elephant medium | Openrouter | 6.5 | 5.2 | 1/2 | 979ms |
| #85 | Elephant none | Openrouter | 6.5 | 5.2 | 1/2 | 1.04s |
| #87 | Qwen3 Coder Next none | Qwen | 6.5 | 5.1 | 1/2 | 1.32s |
| #92 | Qwen3 Coder Next medium | Qwen | 6.5 | 4.7 | 1/2 | 81.8s |
| #96 | GPT-5.4 Nano none | OpenAI | 6.5 | 4.5 | 1/2 | 1.11s |
| #68 | gpt-oss-120b medium | OpenAI | 6.4 | 5.8 | 1/2 | 1.98s |