AI BENCHY Compare
Elephant vs xAI: Grok 4.20
Benchmark dihasilkan dari suite pengujian AI BENCHY pada: 2026-04-14
| Metrik | Elephant Elephant none | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Skor | 5.2 | 5.2 |
| Peringkat | #81 | #78 |
| Konsistensi | 9.6 | 9.5 |
| Tes benar | ||
| Tingkat lulus per percobaan | 31.5% | 29.6% |
| Tes tidak stabil | 1 | 1 |
| Total Run | 54 | 54 |
| Biaya per hasil | 0.000 | 1.889 |
| Total Biaya | $0.000 | $0.095 |
| Harga input | $0.000 / 1M | $2.000 / 1M |
| Harga output | $0.000 / 1M | $6.000 / 1M |
| Token output | 2,573 | 1,967 |
| Token penalaran | 0 | 0 |
| Waktu respons (rata-rata) | 1.23s | 1.11s |
| Waktu respons (maks) | 3.81s | 6.04s |
| Waktu respons (total) | 22.16s | 20.02s |
Skor vs Total Biaya
Waktu respons (rata-rata)
Skor vs Waktu respons (rata-rata)
Total token output
Skor vs Total token output
Rincian Kategori
Perbandingan Cepat
Ganti Pasangan Perbandingan
ElephantmediumvsGrok 4.20noneMiniMax M2.7mediumvsGrok 4.20noneMiniMax M2.7mediumvsElephantnoneMistral Small 4mediumvsGrok 4.20noneMistral Small 4mediumvsElephantnoneElephantnonevsQwen3 Coder NextmediumMiniMax M2.5mediumTersedia gratisvsGrok 4.20noneQwen3 Coder NextmediumvsGrok 4.20noneMiniMax M2.5mediumTersedia gratisvsElephantnoneElephantnonevsGLM 4.7 FlashmediumGrok 4.20nonevsGLM 4.7 Flashmediumgpt-oss-120bmediumTersedia gratisvsGrok 4.20none