AI BENCHY Compare
Elephant vs xAI: Grok 4.20
Last updated at: 2026-04-14
| Metric | Elephant Elephant none | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Score | 5.2 | 5.2 |
| Rank | #81 | #78 |
| Consistency | 9.6 | 9.5 |
| Tests Correct | ||
| Attempt pass rate | 31.5% | 29.6% |
| Flaky tests | 1 | 1 |
| Total Runs | 54 | 54 |
| Cost per result | 0.000 | 1.889 |
| Total Cost | $0.000 | $0.095 |
| Input Price | $0.000 / 1M | $2.000 / 1M |
| Output Price | $0.000 / 1M | $6.000 / 1M |
| Output Tokens | 2,573 | 1,967 |
| Reasoning Tokens | 0 | 0 |
| Response Time (avg) | 1.23s | 1.11s |
| Response Time (max) | 3.81s | 6.04s |
| Response Time (total) | 22.16s | 20.02s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
ElephantmediumvsGrok 4.20noneMiniMax M2.7mediumvsGrok 4.20noneMiniMax M2.7mediumvsElephantnoneMistral Small 4mediumvsGrok 4.20noneMistral Small 4mediumvsElephantnoneElephantnonevsQwen3 Coder NextmediumMiniMax M2.5mediumFree AvailablevsGrok 4.20noneQwen3 Coder NextmediumvsGrok 4.20noneMiniMax M2.5mediumFree AvailablevsElephantnoneElephantnonevsGLM 4.7 FlashmediumGrok 4.20nonevsGLM 4.7 Flashmediumgpt-oss-120bmediumFree AvailablevsGrok 4.20none