AI BENCHY Compare
Inception: Mercury 2 vs xAI: Grok 4.20
Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-04-16
| Kipimo | Mercury 2 Mercury 2 none | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Alama | 4.8 | 5.2 |
| Nafasi | #89 | #80 |
| Uthabiti | 9.0 | 9.5 |
| Majaribio sahihi | ||
| Kiwango cha kupita kwa kila jaribio | 27.8% | 29.6% |
| Majaribio yasiyo thabiti | 2 | 1 |
| Jumla ya uendeshaji | 54 | 54 |
| Gharama kwa matokeo | 0.165 | 1.889 |
| Jumla ya gharama | $0.007 | $0.095 |
| Bei ya ingizo | $0.250 / 1M | $2.000 / 1M |
| Bei ya toleo | $0.750 / 1M | $6.000 / 1M |
| Tokeni za matokeo | 1,625 | 1,967 |
| Tokeni za hoja | 0 | 0 |
| Muda wa majibu (wastani) | 613ms | 1.11s |
| Muda wa majibu (upeo) | 1.27s | 6.04s |
| Muda wa majibu (jumla) | 11.04s | 20.02s |
Alama dhidi ya gharama ya jumla
Muda wa majibu (wastani)
Alama vs Muda wa majibu (wastani)
Jumla ya tokeni za matokeo
Alama vs Jumla ya tokeni za matokeo
Mgawanyo wa kategoria
Ulinganisho wa haraka
Badilisha jozi ya ulinganisho
ElephantmediumvsGrok 4.20noneMiniMax M2.7mediumvsGrok 4.20noneMercury 2nonevsQwen3 Coder NextmediumMercury 2nonevsGLM 4.7 FlashmediumMercury 2nonevsQwen3.5-9BmediumMistral Small 4mediumvsGrok 4.20noneMercury 2nonevsElephantmediumMiniMax M2.5mediumInapatikana burevsGrok 4.20noneMercury 2nonevsMiniMax M2.7mediumQwen3 Coder NextmediumvsGrok 4.20noneGrok 4.20nonevsGLM 4.7 Flashmediumgpt-oss-120bmediumInapatikana burevsGrok 4.20none