Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Inception: Mercury 2 vs OpenAI: GPT-5.3 Chat

Muhtasari

Ulinganisho wa benchmark Mercury 2 vs GPT-5.3 Chat: average score iko karibu sawa: 7.5 vs 7.5. Mercury 2 ina gharama ya chini ya benchmark: $0.058 vs $0.433. Mercury 2 ni ya haraka zaidi: 2.24s vs 6.34s, na pass rates 54.0% vs 66.7%.

Muundo unaopendekezwa: Mercury 2 - It has the best score here (7.5), while costing about 7.5x less than GPT-5.3 Chat.

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-06-12

Kipimo Mercury 2 Mercury 2 medium Toleo: 2026-02-24 GPT-5.3 Chat GPT-5.3 Chat none Toleo: 2026-03-03
Alama 7.5 7.5
Nafasi #46 #47
Uaminifu 10.0 10.0
Uthabiti 8.8 8.1
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 54.0% 66.7%
Majaribio yasiyo thabiti 3 5
Jumla ya uendeshaji 63 63
Gharama kwa matokeo 0.578 3.605
Jumla ya gharama $0.058 $0.433
Bei ya ingizo $0.250 / 1M $1.750 / 1M
Bei ya toleo $0.750 / 1M $14.000 / 1M
Jumla ya tokeni za ingizo 35,116 34,209
Tokeni za matokeo 4,048 26,617
Tokeni za hoja 61,219 0
Muda wa majibu (wastani) 2.24s 6.34s
Muda wa majibu (upeo) 14.63s 18.33s
Muda wa majibu (jumla) 44.72s 133.13s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#46 Mercury 2

medium
Cost
$0.002
Time
2.1s
Tokens
1,702 tok

#47 GPT-5.3 Chat

none
Cost
$0.008
Time
8.1s
Tokens
634 tok

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 6.9 9.9 50.0% 0 1.12s 554 2,546 2,609
GPT-5.3 Chat 6.7 8.1 58.3% 1 3.86s 606 3,167 0
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 8.2 7.7 77.8% 1 2.04s 7,065 296 11,328
GPT-5.3 Chat 5.6 4.7 55.6% 2 10.52s 7,302 6,632 0
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 10.0 10.0 100.0% 0 3.28s 12,909 268 4,887
GPT-5.3 Chat 10.0 10.0 100.0% 0 11.96s 11,019 2,614 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 7.3 5.9 83.3% 1 1.11s 6,234 183 1,656
GPT-5.3 Chat 10.0 10.0 100.0% 0 2.21s 7,140 942 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 2.9 7.2 11.1% 1 6.48s 695 41 30,754
GPT-5.3 Chat 3.5 4.4 33.3% 2 13.01s 723 8,264 0
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 4.8 10.0 0.0% 0 821ms 456 137 542
GPT-5.3 Chat 4.6 10.0 0.0% 0 1.99s 477 319 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 10.0 10.0 100.0% 0 1.07s 340 14 958
GPT-5.3 Chat 9.8 10.0 100.0% 0 3.51s 660 1,491 0
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 5.4 10.0 33.3% 0 949ms 601 361 2,781
GPT-5.3 Chat 10.0 10.0 100.0% 0 2.99s 642 1,758 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 10.0 10.0 100.0% 0 1.89s 6,080 180 1,956
GPT-5.3 Chat 10.0 10.0 100.0% 0 8.36s 5,445 861 0
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Mercury 2 3.0 10.0 0.0% 0 2.58s 182 22 3,748
GPT-5.3 Chat 3.0 10.0 0.0% 0 4.38s 195 569 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho