Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
8.7
Model terbaik
Qwen3.5-9B 3.6| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #31 | DeepSeek V4 Flash high | DeepSeek | 10.0 | 7.7 | 2/2 | 28.0s |
| #73 | Seed-2.0-Mini medium | Bytedance Seed | 10.0 | 6.9 | 2/2 | 24.3s |
| #139 | DeepSeek V4 Flash none | DeepSeek | 10.0 | 5.0 | 2/2 | 23.8s |
| #103 | DeepSeek V4 Pro high | DeepSeek | 7.3 | 6.0 | 1/2 | 23.6s |
| #29 | Qwen3.5-122B-A10B medium | Qwen | 10.0 | 7.8 | 2/2 | 23.4s |
| #79 | Hunter Alpha medium | OpenRouter | 10.0 | 6.7 | 2/2 | 23.2s |
| #130 | MiniMax M2.7 medium | Minimax | 6.3 | 5.3 | 1/2 | 21.9s |
| #18 | Qwen3.7 Plus medium | Qwen | 10.0 | 8.2 | 2/2 | 21.7s |
| #111 | Owl Alpha medium | Openrouter | 10.0 | 5.7 | 2/2 | 21.6s |
| #94 | GPT-5 Nano medium | OpenAI | 3.7 | 6.3 | 0/2 | 21.4s |
| #27 | Gemma 4 31B medium | 10.0 | 7.8 | 2/2 | 21.1s | |
| #60 | Kimi K2.6 medium | Moonshot AI | 10.0 | 7.2 | 2/2 | 20.4s |
| #152 | MiMo-V2-Flash none | Xiaomi | 2.9 | 4.6 | 0/2 | 19.7s |
| #38 | Grok 4.3 medium | X AI | 10.0 | 7.6 | 2/2 | 19.0s |
| #43 | MiMo-V2.5-Pro medium | Xiaomi | 7.3 | 7.5 | 1/2 | 18.8s |