Categoría AI BENCHY
Ranking de Trucos anti-IA
Mira qué modelos de IA rinden mejor en Trucos anti-IA, cuáles se mantienen fiables y dónde aparecen las mayores diferencias. Ordenar por: Pruebas correctas ↓.
Modelos mostrados
15
Promedio de Puntuación de Trucos anti-IA
6.9
Mejor modelo
Gemini 3.5 Flash 10.0
169/169
Filtrar modelos
Ningún modelo coincide con la búsqueda y los filtros actuales.
| Rango | Modelo | Empresa | Puntuación de Trucos anti-IA | Puntuación | Costo total | Pruebas correctas | Tiempo de respuesta (promedio) |
|---|---|---|---|---|---|---|---|
| #73 | Mimo V2 Omni medium | Xiaomi | 10.0 | 6.8 | $0.683 | 4/4 | 2.75s |
| #75 | Qwen3.6 35B A3B medium | Qwen | 10.0 | 6.7 | $0.146 | 4/4 | 6.02s |
| #76 | MiMo-V2.5 medium | Xiaomi | 10.0 | 6.7 | $0.063 | 4/4 | 4.14s |
| #77 | Mimo V2 PRO medium | Xiaomi | 10.0 | 6.7 | $0.333 | 4/4 | 2.86s |
| #80 | Step 3.5 Flash medium | Stepfun | 10.0 | 6.6 | $0.070 | 4/4 | 40.6s |
| #88 | Gemma 4 31B medium | 10.0 | 6.3 | $0.033 | 4/4 | 12.9s | |
| #89 | Qwen3.5-35B-A3B medium | Qwen | 10.0 | 6.3 | $0.401 | 4/4 | 21.1s |
| #91 | Gemini 3 PRO Preview medium | 10.0 | 6.2 | $0.385 | 4/4 | 15.0s | |
| #95 | Gemini 3.1 Flash Lite Preview high | 7.5 | 6.1 | $2.310 | 3/3 | 43.9s | |
| #168 | Step 3.5 Flash none | Stepfun | 10.0 | 2.6 | $0.020 | 4/4 | 35.0s |
| #10 | GPT-5.3-Codex medium | OpenAI | 8.7 | 8.9 | $0.740 | 3/4 | 4.16s |
| #13 | Claude Opus 4.7 medium | Anthropic | 8.3 | 8.7 | $0.679 | 3/4 | 1.85s |
| #17 | GPT-5.4 medium | OpenAI | 8.3 | 8.5 | $1.210 | 3/4 | 4.11s |
| #18 | Seed-2.0-Lite medium | Bytedance Seed | 8.3 | 8.5 | $0.175 | 3/4 | 18.0s |
| #19 | GPT-5.2 Chat none | OpenAI | 8.7 | 8.5 | $0.393 | 3/4 | 3.40s |