Categorie AI BENCHY
Clasament Trucuri anti-AI
Vezi ce modele AI se descurcă cel mai bine la Trucuri anti-AI, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Timp de răspuns (mediu) ↑.
| Rang | Model | Companie | Scor Trucuri anti-AI | Scor | Teste corecte | Timp de răspuns (mediu) |
|---|---|---|---|---|---|---|
| #84 | gpt-oss-120b none | OpenAI | 6.6 | 5.2 | 2/4 | 6.03s |
| #24 | Gemma 4 26B A4B medium | 10.0 | 8.0 | 4/4 | 6.20s | |
| #76 | Kimi K2.5 none | Moonshot AI | 3.6 | 5.5 | 0/4 | 6.24s |
| #15 | Gemini 2.5 Flash medium | 8.4 | 8.2 | 3/4 | 6.30s | |
| #88 | Nemotron 3 Super none | NVIDIA | 4.8 | 5.1 | 1/4 | 7.43s |
| #37 | Claude Opus 4.6 medium | Anthropic | 6.4 | 7.6 | 2/4 | 7.45s |
| #64 | DeepSeek V3.2 none | DeepSeek | 3.2 | 6.1 | 0/4 | 7.63s |
| #40 | GPT-5.2 medium | OpenAI | 6.5 | 7.5 | 2/4 | 7.81s |
| #2 | Gemini 3.1 Pro Preview medium | 10.0 | 9.6 | 4/4 | 7.90s | |
| #33 | GLM 5.1 medium | Z.ai | 10.0 | 7.8 | 4/4 | 8.31s |
| #92 | Qwen3 Coder Next medium | Qwen | 3.5 | 4.7 | 0/4 | 8.64s |
| #19 | Qwen3.5-122B-A10B medium | Qwen | 10.0 | 8.1 | 4/4 | 9.75s |
| #9 | Qwen3.6 Plus Preview medium | Qwen | 10.0 | 8.5 | 4/4 | 9.90s |
| #20 | Qwen3.6 Plus medium | Qwen | 10.0 | 8.1 | 4/4 | 9.90s |
| #51 | Nemotron 3 Super medium | NVIDIA | 10.0 | 6.7 | 4/4 | 10.1s |