Catégorie AI BENCHY
Classement Intelligence générale
Voyez quels modèles d'IA réussissent le mieux sur Intelligence générale, lesquels restent fiables et où les écarts sont les plus marqués. Trier par: Tests corrects ↑.
| Rang | Modèle | Entreprise | Score Intelligence générale | Score | Tests corrects | Temps de réponse (moy.) |
|---|---|---|---|---|---|---|
| #14 | Qwen3.6 Max Preview medium | Qwen | 10.0 | 8.5 | 1/1 | 32.2s |
| #16 | Gemini 3 Flash Preview low | 10.0 | 8.4 | 1/1 | 3.68s | |
| #18 | Qwen3.7 Plus medium | Qwen | 10.0 | 8.2 | 1/1 | 25.5s |
| #20 | Gemini 3.5 Flash none | 10.0 | 8.1 | 1/1 | 3.46s | |
| #27 | Gemma 4 31B medium | 10.0 | 7.8 | 1/1 | 9.57s | |
| #32 | Gemini 3.5 Flash minimal | 10.0 | 7.7 | 1/1 | 922ms | |
| #33 | Hy3 preview medium | Tencent | 10.0 | 7.7 | 1/1 | 16.8s |
| #34 | Qwen3.7 Max none | Qwen | 10.0 | 7.7 | 1/1 | 1.04s |
| #35 | Gemini 3 PRO Preview medium | 10.0 | 7.6 | 1/1 | 9.34s | |
| #37 | Gemma 4 26B A4B medium | 10.0 | 7.6 | 1/1 | 29.8s | |
| #40 | Gemini 3.1 Flash Lite Preview medium | 10.0 | 7.5 | 1/1 | 3.16s | |
| #44 | Gemini 3.1 Flash Lite medium | 10.0 | 7.5 | 1/1 | 2.60s | |
| #48 | Gemini 3 Flash Preview none | 10.0 | 7.4 | 1/1 | 1.13s | |
| #51 | Mimo V2 PRO medium | Xiaomi | 10.0 | 7.4 | 1/1 | 4.92s |
| #52 | Claude Sonnet 4.6 medium | Anthropic | 10.0 | 7.4 | 1/1 | 4.94s |