AI BENCHY Compare
OpenAI: GPT-5.5 vs xAI: Grok 4.3
Benchmark dihasilkan dari suite pengujian AI BENCHY pada: 2026-05-01
| Metrik | GPT-5.5 GPT-5.5 low | Grok 4.3 Grok 4.3 medium |
|---|---|---|
| Skor | 9.0 | 8.2 |
| Peringkat | #5 | #20 |
| Keandalan | 10.0 | 10.0 |
| Konsistensi | 9.6 | 8.6 |
| Tes benar | ||
| Tingkat lulus per percobaan | 87.0% | 81.5% |
| Tes tidak stabil | 1 | 3 |
| Total Run | 54 | 54 |
| Biaya per hasil | 4.534 | 3.974 |
| Total Biaya | $0.681 | $0.517 |
| Harga input | $5.000 / 1M | $1.250 / 1M |
| Harga output | $30.000 / 1M | $2.500 / 1M |
| Token output | 1,959 | 1,223 |
| Token penalaran | 16,134 | 187,047 |
| Waktu respons (rata-rata) | 8.39s | 48.63s |
| Waktu respons (maks) | 56.19s | 216.69s |
| Waktu respons (total) | 151.01s | 875.27s |
Skor vs Total Biaya
Waktu respons (rata-rata)
Skor vs Waktu respons (rata-rata)
Total token output
Skor vs Total token output
Rincian Kategori
Perbandingan Cepat
Ganti Pasangan Perbandingan
HY3 PreviewlowTersedia gratisvsGrok 4.3mediumGemini 3 Flash PreviewnonevsGrok 4.3mediumGemini 3.1 Flash Lite PreviewlowvsGrok 4.3mediumClaude Opus 4.7nonevsGPT-5.5lowClaude Opus 4.7mediumvsGPT-5.5lowGPT-5.5lowvsQwen3.6 Max PreviewmediumGPT-5.5lowvsQwen3.6 35B A3BmediumGPT-5.2 ChatnonevsGrok 4.3mediumGemini 3.1 Flash Lite PreviewnonevsGrok 4.3mediumGPT-5.3 ChatnonevsGrok 4.3mediumGPT-5.5lowvsHY3 PreviewhighTersedia gratisHY3 PreviewhighTersedia gratisvsGrok 4.3medium