Kategori AI BENCHY
Peringkat Kecerdasan umum
Lihat model AI mana yang paling baik di Kecerdasan umum, mana yang tetap andal, dan di mana kesenjangan terbesar muncul.
Model yang ditampilkan
15
Rata-rata Skor Kecerdasan umum
5.9
Model terbaik
Gemini 3 Flash Preview 10.0| Peringkat | Model | Perusahaan | Skor Kecerdasan umum | Skor | Tes benar | Waktu respons (rata-rata) |
|---|---|---|---|---|---|---|
| #18 | Qwen3.7 Plus medium | Qwen | 10.0 | 8.2 | 1/1 | 25.5s |
| #20 | Gemini 3.5 Flash none | 10.0 | 8.1 | 1/1 | 3.46s | |
| #27 | Gemma 4 31B medium | 10.0 | 7.8 | 1/1 | 9.57s | |
| #32 | Gemini 3.5 Flash minimal | 10.0 | 7.7 | 1/1 | 922ms | |
| #33 | Hy3 preview medium | Tencent | 10.0 | 7.7 | 1/1 | 16.8s |
| #34 | Qwen3.7 Max none | Qwen | 10.0 | 7.7 | 1/1 | 1.04s |
| #35 | Gemini 3 PRO Preview medium | 10.0 | 7.6 | 1/1 | 9.34s | |
| #37 | Gemma 4 26B A4B medium | 10.0 | 7.6 | 1/1 | 29.8s | |
| #40 | Gemini 3.1 Flash Lite Preview medium | 10.0 | 7.5 | 1/1 | 3.16s | |
| #44 | Gemini 3.1 Flash Lite medium | 10.0 | 7.5 | 1/1 | 2.60s | |
| #48 | Gemini 3 Flash Preview none | 10.0 | 7.4 | 1/1 | 1.13s | |
| #51 | Mimo V2 PRO medium | Xiaomi | 10.0 | 7.4 | 1/1 | 4.92s |
| #52 | Claude Sonnet 4.6 medium | Anthropic | 10.0 | 7.4 | 1/1 | 4.94s |
| #55 | GLM 5.1 medium | Z.ai | 10.0 | 7.3 | 1/1 | 20.9s |
| #59 | GLM 5V Turbo medium | Z.ai | 10.0 | 7.2 | 1/1 | 11.1s |