Categoría AI BENCHY
Ranking de Trucos anti-IA
Mira qué modelos de IA rinden mejor en Trucos anti-IA, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↓.
Modelos mostrados
15
Promedio de Puntuación de Trucos anti-IA
6.9
Mejor modelo
Gemini 3.5 Flash 10.0
169/169
Filtrar modelos
Ningún modelo coincide con la búsqueda y los filtros actuales.
| Rango | Modelo | Empresa | Puntuación de Trucos anti-IA | Puntuación | Costo total | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|---|
| #160 | Grok Build 0.1 none | X AI | 8.7 | 4.2 | $0.547 | 3/4 | 6.30s |
| #16 | GPT-5 Mini medium | OpenAI | 7.1 | 8.5 | $0.159 | 2/4 | 13.9s |
| #22 | GPT-5.2 medium | OpenAI | 6.5 | 8.4 | $0.548 | 2/4 | 7.81s |
| #31 | Claude Sonnet 4.6 medium | Anthropic | 6.5 | 7.8 | $1.418 | 2/4 | 2.98s |
| #35 | Kimi K2.6 medium | Moonshot AI | 7.0 | 7.8 | $0.889 | 2/4 | 11.6s |
| #38 | Claude Opus 4.6 medium | Anthropic | 6.4 | 7.7 | $2.053 | 2/4 | 7.45s |
| #43 | Kimi K2.5 medium | Moonshot AI | 7.3 | 7.5 | $0.348 | 2/4 | 51.4s |
| #44 | Mercury 2 medium | Inception | 6.9 | 7.5 | $0.058 | 2/4 | 1.12s |
| #45 | GPT-5.3 Chat none | OpenAI | 6.7 | 7.5 | $0.433 | 2/4 | 3.86s |
| #50 | Seed-2.0-Mini medium | Bytedance Seed | 6.6 | 7.4 | $0.044 | 2/4 | 74.7s |
| #56 | GLM 5V Turbo medium | Z.ai | 7.2 | 7.3 | $0.457 | 2/4 | 10.8s |
| #57 | Claude Opus 4.8 none | Anthropic | 6.5 | 7.2 | $0.539 | 2/4 | 3.40s |
| #60 | Qwen3.7 Plus none | Qwen | 6.5 | 7.2 | $0.023 | 2/4 | 1.38s |
| #65 | Kimi K2.7 Code medium | Moonshot AI | 7.3 | 7.0 | $0.583 | 2/4 | 11.6s |
| #68 | Qwen3.7 Max none | Qwen | 6.5 | 6.9 | $0.054 | 2/4 | 1.08s |