AI BENCHY Compare
MoonshotAI: Kimi K2.5 vs xAI: Grok 4.20
Benchmark dihasilkan dari suite pengujian AI BENCHY pada: 2026-05-22
| Metrik | Kimi K2.5 Kimi K2.5 none | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Skor | 5.3 | 5.4 |
| Peringkat | #126 | #120 |
| Keandalan | 10.0 | T/A |
| Konsistensi | 8.9 | 9.5 |
| Tes benar | ||
| Tingkat lulus per percobaan | 36.7% | 35.2% |
| Tes tidak stabil | 3 | 1 |
| Total Run | 60 | 54 |
| Biaya per hasil | 0.428 | 1.574 |
| Total Biaya | $0.026 | $0.095 |
| Harga input | $0.400 / 1M | $1.250 / 1M |
| Harga output | $1.900 / 1M | $2.500 / 1M |
| Token output | 6,734 | 1,967 |
| Token penalaran | 0 | 0 |
| Waktu respons (rata-rata) | 14.16s | 1.11s |
| Waktu respons (maks) | 42.13s | 6.04s |
| Waktu respons (total) | 184.10s | 20.02s |
Skor vs Total Biaya
Waktu respons (rata-rata)
Skor vs Waktu respons (rata-rata)
Total token output
Skor vs Total token output
Rincian Kategori
Perbandingan Cepat
Ganti Pasangan Perbandingan
Mistral Small 4mediumvsGrok 4.20noneMiniMax M2.5mediumTersedia gratisvsGrok 4.20noneElephant AlphamediumvsGrok 4.20noneKimi K2.5nonevsElephant AlphamediumMistral Small 4mediumvsKimi K2.5noneMiniMax M2.5mediumTersedia gratisvsKimi K2.5nonegpt-oss-120bmediumTersedia gratisvsGrok 4.20noneMiniMax M2.7mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumTersedia gratisCobuddymediumTersedia gratisvsGrok 4.20noneMiniMax M2.7mediumvsGrok 4.20noneOwl AlphamediumvsGrok 4.20none