Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
8.7
Model terbaik
Qwen3.5-9B 3.6| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #32 | Gemini 3.5 Flash minimal | 10.0 | 7.7 | 2/2 | 1.66s | |
| #108 | Qwen3.5-Flash none | Qwen | 10.0 | 5.8 | 2/2 | 1.57s |
| #158 | GLM 4.7 Flash medium | Z.ai | 6.3 | 4.4 | 1/2 | 1.51s |
| #153 | Qwen3.6 35B A3B none | Qwen | 10.0 | 4.6 | 2/2 | 1.46s |
| #61 | Gemini 3.1 Flash Lite low | 10.0 | 7.2 | 2/2 | 1.44s | |
| #88 | Qwen3.7 Plus none | Qwen | 10.0 | 6.4 | 2/2 | 1.43s |
| #115 | Qwen3.5-27B none | Qwen | 10.0 | 5.7 | 2/2 | 1.43s |
| #162 | Nemotron 3 Nano Omni 30b A3b Reasoning none | NVIDIA | 3.8 | 4.1 | 0/2 | 1.42s |
| #48 | Gemini 3 Flash Preview none | 10.0 | 7.4 | 2/2 | 1.41s | |
| #120 | Mimo V2 PRO none | Xiaomi | 10.0 | 5.6 | 2/2 | 1.39s |
| #159 | Ling-2.6-1T none | Inclusionai | 10.0 | 4.3 | 2/2 | 1.37s |
| #34 | Qwen3.7 Max none | Qwen | 10.0 | 7.7 | 2/2 | 1.35s |
| #124 | Kimi K2.6 none | Moonshot AI | 10.0 | 5.5 | 2/2 | 1.32s |
| #123 | MiMo-V2.5-Pro none | Xiaomi | 10.0 | 5.5 | 2/2 | 1.32s |
| #140 | Qwen3 Coder Next none | Qwen | 6.5 | 4.9 | 1/2 | 1.32s |