AI BENCHY Compare
Inception: Mercury 2 vs xAI: Grok 4.20
Last updated at: 2026-04-16
| Metric | Mercury 2 Mercury 2 none | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Score | 4.8 | 5.2 |
| Rank | #89 | #80 |
| Consistency | 9.0 | 9.5 |
| Tests Correct | ||
| Attempt pass rate | 27.8% | 29.6% |
| Flaky tests | 2 | 1 |
| Total Runs | 54 | 54 |
| Cost per result | 0.165 | 1.889 |
| Total Cost | $0.007 | $0.095 |
| Input Price | $0.250 / 1M | $2.000 / 1M |
| Output Price | $0.750 / 1M | $6.000 / 1M |
| Output Tokens | 1,625 | 1,967 |
| Reasoning Tokens | 0 | 0 |
| Response Time (avg) | 613ms | 1.11s |
| Response Time (max) | 1.27s | 6.04s |
| Response Time (total) | 11.04s | 20.02s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
ElephantmediumvsGrok 4.20noneMiniMax M2.7mediumvsGrok 4.20noneMercury 2nonevsQwen3 Coder NextmediumMercury 2nonevsGLM 4.7 FlashmediumMercury 2nonevsQwen3.5-9BmediumMistral Small 4mediumvsGrok 4.20noneMercury 2nonevsElephantmediumMiniMax M2.5mediumFree AvailablevsGrok 4.20noneMercury 2nonevsMiniMax M2.7mediumQwen3 Coder NextmediumvsGrok 4.20noneGrok 4.20nonevsGLM 4.7 Flashmediumgpt-oss-120bmediumFree AvailablevsGrok 4.20none