Categoría AI BENCHY
Ranking de Inteligencia general
Mira qué modelos de IA rinden mejor en Inteligencia general, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↓.
Modelos mostrados
15
Promedio de Puntuación de Inteligencia general
5.9
Mejor modelo
Gemini 3 Flash Preview 10.0| Rango | Modelo | Empresa | Puntuación de Inteligencia general | Puntuación | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|
| #18 | Qwen3.7 Plus medium | Qwen | 10.0 | 8.2 | 1/1 | 25.5s |
| #20 | Gemini 3.5 Flash none | 10.0 | 8.1 | 1/1 | 3.46s | |
| #27 | Gemma 4 31B medium | 10.0 | 7.8 | 1/1 | 9.57s | |
| #32 | Gemini 3.5 Flash minimal | 10.0 | 7.7 | 1/1 | 922ms | |
| #33 | Hy3 preview medium | Tencent | 10.0 | 7.7 | 1/1 | 16.8s |
| #34 | Qwen3.7 Max none | Qwen | 10.0 | 7.7 | 1/1 | 1.04s |
| #35 | Gemini 3 PRO Preview medium | 10.0 | 7.6 | 1/1 | 9.34s | |
| #37 | Gemma 4 26B A4B medium | 10.0 | 7.6 | 1/1 | 29.8s | |
| #40 | Gemini 3.1 Flash Lite Preview medium | 10.0 | 7.5 | 1/1 | 3.16s | |
| #44 | Gemini 3.1 Flash Lite medium | 10.0 | 7.5 | 1/1 | 2.60s | |
| #48 | Gemini 3 Flash Preview none | 10.0 | 7.4 | 1/1 | 1.13s | |
| #51 | Mimo V2 PRO medium | Xiaomi | 10.0 | 7.4 | 1/1 | 4.92s |
| #52 | Claude Sonnet 4.6 medium | Anthropic | 10.0 | 7.4 | 1/1 | 4.94s |
| #55 | GLM 5.1 medium | Z.ai | 10.0 | 7.3 | 1/1 | 20.9s |
| #59 | GLM 5V Turbo medium | Z.ai | 10.0 | 7.2 | 1/1 | 11.1s |