AI BENCHY Compare
Inception: Mercury 2 vs OpenAI: GPT-5.4
Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-04-16
| Kipimo | Mercury 2 Mercury 2 medium | GPT-5.4 GPT-5.4 none |
|---|---|---|
| Alama | 6.5 | 5.9 |
| Nafasi | #53 | #65 |
| Uthabiti | 8.6 | 9.1 |
| Majaribio sahihi | ||
| Kiwango cha kupita kwa kila jaribio | 53.7% | 42.6% |
| Majaribio yasiyo thabiti | 3 | 2 |
| Jumla ya uendeshaji | 54 | 54 |
| Gharama kwa matokeo | 0.580 | 1.477 |
| Jumla ya gharama | $0.047 | $0.104 |
| Bei ya ingizo | $0.250 / 1M | $2.500 / 1M |
| Bei ya toleo | $0.750 / 1M | $15.000 / 1M |
| Tokeni za matokeo | 3,972 | 2,317 |
| Tokeni za hoja | 48,333 | 0 |
| Muda wa majibu (wastani) | 2.21s | 1.51s |
| Muda wa majibu (upeo) | 14.63s | 2.95s |
| Muda wa majibu (jumla) | 37.51s | 27.21s |
Alama dhidi ya gharama ya jumla
Muda wa majibu (wastani)
Alama vs Muda wa majibu (wastani)
Jumla ya tokeni za matokeo
Alama vs Jumla ya tokeni za matokeo
Mgawanyo wa kategoria
Ulinganisho wa haraka
Badilisha jozi ya ulinganisho
Mercury 2mediumvsMiMo-V2-OmninoneMercury 2mediumvsGLM 5noneMiniMax M2.5mediumInapatikana burevsGPT-5.4noneMistral Small 4mediumvsGPT-5.4noneMercury 2mediumvsQwen3.5 Plus 2026-02-15noneMercury 2mediumvsGLM 5V TurbononeMercury 2mediumvsQwen3.5-FlashnoneGemma 4 26B A4BnoneInapatikana burevsMercury 2mediumSeed-2.0-LitenonevsMercury 2mediumGemini 2.5 FlashnonevsMercury 2mediumMercury 2mediumvsQwen3.5-35B-A3BnoneDeepSeek V3.2nonevsMercury 2medium