Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Metrik ↑.
| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #112 | GLM 5.1 none | Z.ai | 10.0 | 5.7 | 2/2 | 1.08s |
| #114 | Qwen3.5 Plus 2026-04-20 none | Qwen | 10.0 | 5.7 | 2/2 | 2.82s |
| #115 | Qwen3.5-27B none | Qwen | 10.0 | 5.7 | 2/2 | 1.43s |
| #116 | Hunter Alpha none | OpenRouter | 10.0 | 5.7 | 2/2 | 8.49s |
| #117 | Qwen3.5-35B-A3B none | Qwen | 10.0 | 5.6 | 2/2 | 1.16s |
| #120 | Mimo V2 PRO none | Xiaomi | 10.0 | 5.6 | 2/2 | 1.39s |
| #121 | Owl Alpha none | Openrouter | 10.0 | 5.5 | 2/2 | 3.60s |
| #123 | MiMo-V2.5-Pro none | Xiaomi | 10.0 | 5.5 | 2/2 | 1.32s |
| #124 | Kimi K2.6 none | Moonshot AI | 10.0 | 5.5 | 2/2 | 1.32s |
| #125 | GPT-5.4 none | OpenAI | 10.0 | 5.5 | 2/2 | 1.04s |
| #127 | Grok 4.20 none | X AI | 10.0 | 5.4 | 2/2 | 522ms |
| #128 | Qwen3.6 Flash none | Qwen | 10.0 | 5.4 | 2/2 | 2.13s |
| #131 | Qwen3.5-122B-A10B none | Qwen | 10.0 | 5.3 | 2/2 | 1.01s |
| #134 | GLM 5 Turbo none | Z.ai | 10.0 | 5.2 | 2/2 | 2.47s |
| #141 | Nemotron 3 Super none | NVIDIA | 10.0 | 4.9 | 2/2 | 7.92s |