Categorie AI BENCHY
Clasament Specific domeniului
Vezi ce modele AI se descurcă cel mai bine la Specific domeniului, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Teste corecte ↓.
Modele afișate
15
Media pentru Scor Specific domeniului
4.8
Cel mai bun model
Gemini 3 Flash Preview 10.0| Rang | Model | Companie | Scor Specific domeniului | Scor | Teste corecte | Timp de răspuns (mediu) |
|---|---|---|---|---|---|---|
| #89 | Hy3 preview low | Tencent | 5.9 | 6.4 | 1/3 | 40.4s |
| #92 | Laguna M.1 medium | Poolside | 5.3 | 6.4 | 1/3 | 24.1s |
| #94 | GPT-5 Nano medium | OpenAI | 5.2 | 6.3 | 1/3 | 204.0s |
| #95 | Qwen3.5 Plus 2026-02-15 none | Qwen | 5.3 | 6.3 | 1/3 | 1.17s |
| #96 | Ring-2.6-1T none | Inclusionai | 5.3 | 6.2 | 1/3 | 73.4s |
| #97 | Gemini 2.5 Flash none | 5.9 | 6.2 | 1/3 | 495ms | |
| #101 | Mimo V2 Omni none | Xiaomi | 5.3 | 6.0 | 1/3 | 2.10s |
| #104 | Nemotron 3 Ultra 550b A55b none | NVIDIA | 5.3 | 6.0 | 1/3 | 698ms |
| #109 | GLM 5V Turbo none | Z.ai | 5.3 | 5.8 | 1/3 | 2.09s |
| #111 | Owl Alpha medium | Openrouter | 5.3 | 5.7 | 1/3 | 8.58s |
| #113 | DeepSeek V4 Pro none | DeepSeek | 5.3 | 5.7 | 1/3 | 3.17s |
| #114 | Qwen3.5 Plus 2026-04-20 none | Qwen | 5.3 | 5.7 | 1/3 | 4.43s |
| #116 | Hunter Alpha none | OpenRouter | 5.3 | 5.7 | 1/3 | 2.33s |
| #120 | Mimo V2 PRO none | Xiaomi | 5.3 | 5.6 | 1/3 | 1.78s |
| #121 | Owl Alpha none | Openrouter | 5.3 | 5.5 | 1/3 | 3.00s |