Categoría AI BENCHY
Ranking de Trucos anti-IA
Mira qué modelos de IA rinden mejor en Trucos anti-IA, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↑.
| Rango | Modelo | Empresa | Puntuación de Trucos anti-IA | Puntuación | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|
| #2 | Gemini 3.1 Pro Preview medium | 10.0 | 9.6 | 4/4 | 7.90s | |
| #5 | GPT-5.5 low | OpenAI | 10.0 | 9.0 | 4/4 | 4.15s |
| #6 | GPT-5.5 medium | OpenAI | 10.0 | 9.0 | 4/4 | 4.66s |
| #7 | Gemini 3 Flash Preview low | 10.0 | 8.8 | 4/4 | 3.48s | |
| #11 | HY3 Preview high | Tencent | 10.0 | 8.5 | 4/4 | 32.7s |
| #12 | Qwen3.6 Plus Preview medium | Qwen | 10.0 | 8.5 | 4/4 | 9.90s |
| #14 | Gemini 3.1 Flash Lite Preview high | 10.0 | 8.4 | 3/3 | 43.9s | |
| #15 | Gemini 3 PRO Preview medium | 10.0 | 8.4 | 4/4 | 15.0s | |
| #16 | GLM 5 medium | Z.ai | 10.0 | 8.4 | 4/4 | 23.7s |
| #17 | Gemma 4 31B medium | 10.0 | 8.3 | 4/4 | 12.9s | |
| #21 | GLM 5 Turbo medium | Z.ai | 10.0 | 8.1 | 4/4 | 4.82s |
| #22 | Qwen3.5-122B-A10B medium | Qwen | 10.0 | 8.1 | 4/4 | 9.75s |
| #23 | Qwen3.6 Plus medium | Qwen | 10.0 | 8.1 | 4/4 | 9.90s |
| #24 | HY3 Preview low | Tencent | 10.0 | 8.1 | 4/4 | 16.6s |
| #27 | MiMo-V2.5-Pro medium | Xiaomi | 10.0 | 8.1 | 4/4 | 2.95s |