Kategori AI BENCHY
Peringkat Parsing dan ekstraksi data
Lihat model AI mana yang paling baik di Parsing dan ekstraksi data, mana yang tetap andal, dan di mana kesenjangan terbesar muncul. Urutkan berdasarkan: Waktu respons (rata-rata) ↓.
Model yang ditampilkan
15
Rata-rata Skor Parsing dan ekstraksi data
9.0
Model terbaik
Qwen3.5-9B 3.6| Peringkat | Model | Perusahaan | Skor Parsing dan ekstraksi data | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #1 | Gemini 3 Flash Preview medium | 10.0 | 10.0 | 2/2 | 4.72s | |
| #47 | Grok 4.20 medium | X AI | 10.0 | 7.0 | 2/2 | 4.17s |
| #15 | Gemini 2.5 Flash medium | 10.0 | 8.2 | 2/2 | 4.06s | |
| #25 | Grok 4.20 Beta medium | X AI | 10.0 | 8.0 | 2/2 | 4.01s |
| #58 | GLM 5V Turbo none | Z.ai | 10.0 | 6.2 | 2/2 | 3.81s |
| #42 | Claude Sonnet 4.6 none | Anthropic | 10.0 | 7.4 | 2/2 | 3.43s |
| #78 | Trinity Large Preview none | Arcee AI | 10.0 | 5.3 | 2/2 | 3.26s |
| #40 | GPT-5.2 medium | OpenAI | 10.0 | 7.5 | 2/2 | 3.15s |
| #7 | GPT-5.3-Codex medium | OpenAI | 10.0 | 8.6 | 2/2 | 3.07s |
| #28 | GPT-5.2 Chat none | OpenAI | 10.0 | 7.9 | 2/2 | 3.05s |
| #22 | Gemini 3.1 Flash Lite Preview low | 10.0 | 8.1 | 2/2 | 3.00s | |
| #38 | GPT-5.4 Nano medium | OpenAI | 10.0 | 7.6 | 2/2 | 2.54s |
| #77 | GLM 5 Turbo none | Z.ai | 10.0 | 5.5 | 2/2 | 2.47s |
| #44 | GPT-5.4 Mini medium | OpenAI | 10.0 | 7.3 | 2/2 | 2.43s |
| #3 | Claude Opus 4.7 medium | Anthropic | 10.0 | 9.2 | 2/2 | 2.37s |