Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
8.7
Model terbaik
Qwen3.5-9B 3.6| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #105 | Nemotron 3 Super medium | NVIDIA | 10.0 | 5.8 | 2/2 | 18.2s |
| #119 | Cobuddy medium | Baidu | 6.3 | 5.6 | 1/2 | 17.4s |
| #51 | Mimo V2 PRO medium | Xiaomi | 7.3 | 7.4 | 1/2 | 17.2s |
| #37 | Gemma 4 26B A4B medium | 10.0 | 7.6 | 2/2 | 16.5s | |
| #62 | Step 3.5 Flash medium | Stepfun | 10.0 | 7.2 | 2/2 | 15.0s |
| #26 | Qwen3.6 Plus medium | Qwen | 10.0 | 7.9 | 2/2 | 14.9s |
| #93 | Qwen3.6 Plus Preview medium | Qwen | 10.0 | 6.3 | 2/2 | 14.9s |
| #67 | MiniMax M3 medium | Minimax | 10.0 | 7.1 | 2/2 | 14.9s |
| #71 | Step 3.7 Flash high | Stepfun | 10.0 | 7.0 | 2/2 | 14.7s |
| #52 | Claude Sonnet 4.6 medium | Anthropic | 10.0 | 7.4 | 2/2 | 13.9s |
| #46 | Qwen3.6 35B A3B medium | Qwen | 10.0 | 7.4 | 2/2 | 13.0s |
| #54 | GPT-5 Mini medium | OpenAI | 10.0 | 7.3 | 2/2 | 12.6s |
| #10 | Claude Opus 4.8 medium | Anthropic | 7.1 | 8.7 | 1/2 | 12.3s |
| #82 | Hy3 preview high | Tencent | 6.5 | 6.6 | 1/2 | 12.1s |
| #35 | Gemini 3 PRO Preview medium | 10.0 | 7.6 | 2/2 | 10.8s |