AI BENCHY Compare
Inception: Mercury 2 vs xAI: Grok 4.20
Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-04-02
| Kipimo | Mercury 2 Mercury 2 medium | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Alama | 6.3 | 5.4 |
| Nafasi | #51 | #69 |
| Uthabiti | 8.5 | 9.5 |
| Majaribio sahihi | ||
| Kiwango cha kupita kwa kila jaribio | 51.0% | 31.4% |
| Majaribio yasiyo thabiti | 3 | 1 |
| Jumla ya uendeshaji | 51 | 51 |
| Gharama kwa matokeo | 0.634 | 1.809 |
| Jumla ya gharama | $0.045 | $0.091 |
| Bei ya ingizo | $0.250 / 1M | $2.000 / 1M |
| Bei ya toleo | $0.750 / 1M | $6.000 / 1M |
| Tokeni za matokeo | 3,723 | 1,655 |
| Tokeni za hoja | 46,120 | 0 |
| Muda wa majibu (wastani) | 2.25s | 1.11s |
| Muda wa majibu (upeo) | 14.63s | 6.04s |
| Muda wa majibu (jumla) | 35.99s | 18.80s |
Alama dhidi ya gharama ya jumla
Muda wa majibu (wastani)
Alama vs Muda wa majibu (wastani)
Jumla ya tokeni za matokeo
Alama vs Jumla ya tokeni za matokeo
Mgawanyo wa kategoria
Ulinganisho wa haraka
Badilisha jozi ya ulinganisho
DeepSeek V3.2nonevsMercury 2mediumMercury 2mediumvsMiMo-V2-OmninoneMistral Small 4mediumvsGrok 4.20noneMercury 2mediumvsQwen3.5-FlashnoneMercury 2mediumvsGLM 5V TurbononeSeed-2.0-LitenonevsMercury 2mediumMiniMax M2.7mediumvsGrok 4.20noneGemini 2.5 FlashnonevsMercury 2mediumMercury 2mediumvsQwen3.5-35B-A3BnoneMercury 2mediumvsGLM 5noneGemma 4 31BnonevsMercury 2mediumMercury 2mediumvsHunter Alphanone