Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Tes benar ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
Gemini 3 Flash Preview 10.0| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #32 | Qwen3.5-Flash medium | Qwen | 7.3 | 7.8 | 1/2 | 57.0s |
| #41 | MiMo-V2-Flash medium | Xiaomi | 6.5 | 7.5 | 1/2 | 0ms |
| #43 | Qwen3.5-35B-A3B medium | Qwen | 7.3 | 7.4 | 1/2 | 59.3s |
| #54 | Mercury 2 medium | Inception | 7.3 | 6.5 | 1/2 | 1.11s |
| #64 | DeepSeek V3.2 none | DeepSeek | 6.3 | 6.1 | 1/2 | 9.42s |
| #68 | gpt-oss-120b medium | OpenAI | 6.4 | 5.8 | 1/2 | 1.98s |
| #73 | Mistral Small 4 medium | Mistral | 7.3 | 5.7 | 1/2 | 1.23s |
| #74 | GLM 4.7 Flash none | Z.ai | 7.3 | 5.6 | 1/2 | 4.82s |
| #76 | Kimi K2.5 none | Moonshot AI | 7.3 | 5.5 | 1/2 | 42.1s |
| #80 | MiniMax M2.7 medium | Minimax | 6.3 | 5.3 | 1/2 | 21.9s |
| #81 | Elephant medium | Openrouter | 6.5 | 5.2 | 1/2 | 979ms |
| #84 | gpt-oss-120b none | OpenAI | 6.5 | 5.2 | 1/2 | 7.12s |
| #85 | Elephant none | Openrouter | 6.5 | 5.2 | 1/2 | 1.04s |
| #87 | Qwen3 Coder Next none | Qwen | 6.5 | 5.1 | 1/2 | 1.32s |
| #91 | Mercury 2 none | Inception | 7.3 | 4.8 | 1/2 | 667ms |