Categoría AI BENCHY
Ranking de Trucos anti-IA
Mira qué modelos de IA rinden mejor en Trucos anti-IA, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↑.
| Rango | Modelo | Empresa | Puntuación de Trucos anti-IA | Puntuación | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|
| #79 | gpt-oss-120b medium | OpenAI | 6.7 | 5.8 | 2/4 | 10.2s |
| #83 | MiniMax M2.5 medium | Minimax | 7.9 | 5.7 | 2/4 | 20.8s |
| #90 | Ling 2.6 Flash none | Inclusionai | 6.5 | 5.4 | 2/4 | 12.3s |
| #94 | MiniMax M2.7 medium | Minimax | 7.9 | 5.3 | 2/4 | 40.3s |
| #95 | Elephant Alpha medium | Openrouter | 6.6 | 5.2 | 2/4 | 1.19s |
| #98 | gpt-oss-120b none | OpenAI | 6.6 | 5.2 | 2/4 | 6.03s |
| #99 | Elephant Alpha none | Openrouter | 6.6 | 5.2 | 2/4 | 963ms |
| #3 | Claude Opus 4.7 medium | Anthropic | 8.3 | 9.2 | 3/4 | 1.85s |
| #4 | Claude Opus 4.7 none | Anthropic | 8.3 | 9.2 | 3/4 | 2.12s |
| #8 | Seed-2.0-Lite medium | Bytedance Seed | 8.3 | 8.6 | 3/4 | 18.0s |
| #9 | GPT-5.3-Codex medium | OpenAI | 8.7 | 8.6 | 3/4 | 4.16s |
| #10 | Qwen3.5 Plus 2026-02-15 medium | Qwen | 8.2 | 8.5 | 3/4 | 45.8s |
| #13 | Qwen3.5-27B medium | Qwen | 8.7 | 8.4 | 3/4 | 19.8s |
| #18 | Gemini 2.5 Flash medium | 8.4 | 8.2 | 3/4 | 6.30s | |
| #19 | GPT-5.4 medium | OpenAI | 8.3 | 8.2 | 3/4 | 4.11s |