Categoría AI BENCHY
Ranking de Trucos anti-IA
Mira qué modelos de IA rinden mejor en Trucos anti-IA, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↓.
Modelos mostrados
15
Promedio de Puntuación de Trucos anti-IA
6.9
Mejor modelo
Gemini 3.5 Flash 10.0
169/169
Filtrar modelos
Ningún modelo coincide con la búsqueda y los filtros actuales.
| Rango | Modelo | Empresa | Puntuación de Trucos anti-IA | Puntuación | Costo total | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|---|
| #152 | Elephant Alpha none | Openrouter | 6.6 | 4.6 | $0.000 | 2/4 | 963ms |
| #153 | Elephant Alpha medium | Openrouter | 6.6 | 4.5 | $0.000 | 2/4 | 1.19s |
| #156 | Laguna Xs.2 medium | Poolside | 6.9 | 4.3 | $0.000 | 2/4 | 2.68s |
| #164 | gpt-oss-120b none | OpenAI | 6.5 | 4.0 | $0.010 | 2/4 | 32.8s |
| #166 | Nemotron 3 Nano Omni 30b A3b Reasoning medium | NVIDIA | 6.4 | 3.6 | $0.000 | 2/4 | 1.20s |
| #40 | MiniMax M3 medium | Minimax | 5.5 | 7.6 | $0.131 | 1/4 | 14.9s |
| #41 | DeepSeek V4 Pro high | DeepSeek | 5.7 | 7.6 | $0.157 | 1/4 | 25.7s |
| #55 | Claude Sonnet 4.6 none | Anthropic | 4.8 | 7.3 | $0.316 | 1/4 | 2.94s |
| #100 | Qwen3.6 Max Preview none | Qwen | 5.2 | 6.0 | $0.075 | 1/4 | 2.63s |
| #101 | GLM 5 none | Z.ai | 4.8 | 6.0 | $0.027 | 1/4 | 2.37s |
| #104 | Qwen3.5-27B none | Qwen | 4.8 | 5.9 | $0.015 | 1/4 | 788ms |
| #105 | GLM 5V Turbo none | Z.ai | 4.8 | 5.9 | $0.052 | 1/4 | 3.13s |
| #106 | Qwen3.5 Plus 2026-02-15 none | Qwen | 4.8 | 5.8 | $0.016 | 1/4 | 1.91s |
| #108 | Owl Alpha medium | Openrouter | 4.8 | 5.8 | $0.000 | 1/4 | 3.97s |
| #111 | Kimi K2.6 none | Moonshot AI | 4.6 | 5.8 | $0.079 | 1/4 | 1.39s |