Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↑.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
MiMo-V2-Flash 6.5| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #96 | GPT-5.4 Nano none | OpenAI | 6.5 | 4.5 | 1/2 | 1.11s |
| #63 | Qwen3.5-35B-A3B none | Qwen | 10.0 | 6.1 | 2/2 | 1.16s |
| #29 | Gemini 3.1 Flash Lite Preview none | 10.0 | 7.9 | 2/2 | 1.22s | |
| #73 | Mistral Small 4 medium | Mistral | 7.3 | 5.7 | 1/2 | 1.23s |
| #89 | GPT-4o-mini none | OpenAI | 10.0 | 4.9 | 2/2 | 1.27s |
| #86 | GPT-5.4 Mini none | OpenAI | 10.0 | 5.1 | 2/2 | 1.30s |
| #87 | Qwen3 Coder Next none | Qwen | 6.5 | 5.1 | 1/2 | 1.32s |
| #69 | Kimi K2.6 none | Moonshot AI | 10.0 | 5.8 | 2/2 | 1.32s |
| #65 | MiMo-V2-Pro none | Xiaomi | 10.0 | 6.0 | 2/2 | 1.39s |
| #21 | Gemini 3 Flash Preview none | 10.0 | 8.1 | 2/2 | 1.41s | |
| #67 | Qwen3.5-27B none | Qwen | 10.0 | 5.9 | 2/2 | 1.43s |
| #93 | GLM 4.7 Flash medium | Z.ai | 6.3 | 4.6 | 1/2 | 1.51s |
| #59 | Qwen3.5-Flash none | Qwen | 10.0 | 6.2 | 2/2 | 1.57s |
| #55 | MiMo-V2-Omni none | Xiaomi | 10.0 | 6.5 | 2/2 | 1.69s |
| #60 | Gemma 4 26B A4B none | 10.0 | 6.2 | 2/2 | 1.70s |