Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Tes benar ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
8.7
Model terbaik
Gemini 3 Flash Preview 10.0| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #95 | Qwen3.5 Plus 2026-02-15 none | Qwen | 10.0 | 6.3 | 2/2 | 1.89s |
| #97 | Gemini 2.5 Flash none | 10.0 | 6.2 | 2/2 | 652ms | |
| #98 | GLM 5 none | Z.ai | 10.0 | 6.1 | 2/2 | 5.78s |
| #101 | Mimo V2 Omni none | Xiaomi | 10.0 | 6.0 | 2/2 | 1.76s |
| #102 | Gemma 4 26B A4B none | 10.0 | 6.0 | 2/2 | 1.70s | |
| #104 | Nemotron 3 Ultra 550b A55b none | NVIDIA | 10.0 | 6.0 | 2/2 | 1.94s |
| #105 | Nemotron 3 Super medium | NVIDIA | 10.0 | 5.8 | 2/2 | 18.2s |
| #106 | Grok 4.20 Beta none | X AI | 10.0 | 5.8 | 2/2 | 601ms |
| #108 | Qwen3.5-Flash none | Qwen | 10.0 | 5.8 | 2/2 | 1.57s |
| #109 | GLM 5V Turbo none | Z.ai | 10.0 | 5.8 | 2/2 | 3.81s |
| #110 | Seed-2.0-Lite none | Bytedance Seed | 10.0 | 5.8 | 2/2 | 1.82s |
| #111 | Owl Alpha medium | Openrouter | 10.0 | 5.7 | 2/2 | 21.6s |
| #112 | GLM 5.1 none | Z.ai | 10.0 | 5.7 | 2/2 | 1.08s |
| #114 | Qwen3.5 Plus 2026-04-20 none | Qwen | 10.0 | 5.7 | 2/2 | 2.82s |
| #115 | Qwen3.5-27B none | Qwen | 10.0 | 5.7 | 2/2 | 1.43s |