AI BENCHY Compare
MoonshotAI: Kimi K2.6 vs OpenAI: GPT-5.5
Benchmark dihasilkan dari suite pengujian AI BENCHY pada: 2026-04-24
| Metrik | Kimi K2.6 Kimi K2.6 medium | GPT-5.5 GPT-5.5 none |
|---|---|---|
| Skor | 7.7 | 6.8 |
| Peringkat | #42 | #58 |
| Keandalan | T/A | T/A |
| Konsistensi | 8.3 | 8.3 |
| Tes benar | ||
| Tingkat lulus per percobaan | 74.1% | 61.1% |
| Tes tidak stabil | 4 | 4 |
| Total Run | 54 | 54 |
| Biaya per hasil | 6.563 | 2.162 |
| Total Biaya | $0.722 | $0.195 |
| Harga input | $0.745 / 1M | $5.000 / 1M |
| Harga output | $4.655 / 1M | $30.000 / 1M |
| Token output | 80,759 | 1,910 |
| Token penalaran | 179,814 | 0 |
| Waktu respons (rata-rata) | 45.20s | 1.83s |
| Waktu respons (maks) | 215.85s | 5.56s |
| Waktu respons (total) | 768.37s | 32.86s |
Skor vs Total Biaya
Waktu respons (rata-rata)
Skor vs Waktu respons (rata-rata)
Total token output
Skor vs Total token output
Rincian Kategori
Perbandingan Cepat
Ganti Pasangan Perbandingan
Nemotron 3 SupermediumTersedia gratisvsGPT-5.5noneKimi K2.6mediumvsGPT-5.3 ChatnoneGPT-5.5nonevsGrok 4.1 FastmediumDeepSeek V4 FlashhighvsKimi K2.6mediumGemini 3.1 Flash Lite PreviewnonevsKimi K2.6mediumKimi K2.6mediumvsGPT-5.2 ChatnoneGPT-5.5nonevsGrok 4.20mediumKimi K2.5mediumvsGPT-5.5noneMercury 2mediumvsGPT-5.5noneClaude Sonnet 4.6nonevsKimi K2.6mediumGemini 3.1 Flash Lite PreviewlowvsKimi K2.6mediumGemini 3 Flash PreviewnonevsKimi K2.6medium