Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Inception: Mercury 2 vs xAI: Grok 4.1 Fast

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-05

Kipimo Inception: Mercury 2 medium Toleo: 2026-02-24 xAI: Grok 4.1 Fast none Toleo: 2025-11-19
Wastani wa alama 5.4 2.9
Majaribio sahihi
Nafasi #35 #53
Uthabiti 8.3 8.9
Gharama kwa matokeo 0.622 0.239
Jumla ya gharama $0.044 $0.008
Kiwango cha kupita kwa kila jaribio 57.8% 26.7%
Majaribio yasiyo thabiti 3 2
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 3,571 1,036
Tokeni za hoja 45,379 0
Muda wa majibu (wastani) 2.47s 2.01s
Muda wa majibu (upeo) 14.63s 5.51s
Muda wa majibu (jumla) 34.56s 16.06s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 7.3 9.8 66.7% 0 1.30s 2,531 2,410
xAI: Grok 4.1 Fast 1.3 10.0 0.0% 0 1.73s 229 0
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 100.0% 0 3.28s 268 4,887
xAI: Grok 4.1 Fast 10.0 10.0 0.0% 0 3.33s 105 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 5.5 5.9 83.3% 1 1.11s 183 1,656
xAI: Grok 4.1 Fast 9.9 10.0 100.0% 0 943ms 180 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 7.2 11.1% 1 6.48s 41 30,754
xAI: Grok 4.1 Fast 4.0 7.2 55.6% 1 1.06s 15 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 100.0% 0 1.07s 14 958
xAI: Grok 4.1 Fast 10.0 10.0 0.0% 0 923ms 56 0
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 1.7 7.5 22.2% 1 934ms 354 2,758
xAI: Grok 4.1 Fast 1.3 10.0 0.0% 0 1.28s 243 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 100.0% 0 1.89s 180 1,956
xAI: Grok 4.1 Fast 10.0 1.6 33.3% 1 5.51s 208 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho