Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

Inception: Mercury 2 vs OpenAI: GPT-5 Mini

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-05

Kipimo Inception: Mercury 2 none Toleo: 2026-02-24 OpenAI: GPT-5 Mini medium Toleo: 2025-08-07
Nafasi #50 #31
Wastani wa alama 3.4 6.1
Majaribio sahihi
Uthabiti 8.9 8.9
Gharama kwa matokeo 0.147 1.401
Jumla ya gharama $0.006 $0.113
Kiwango cha kupita kwa kila jaribio 33.3% 62.2%
Majaribio yasiyo thabiti 2 2
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 1,144 5,477
Tokeni za hoja 0 46,912
Muda wa majibu (wastani) 594ms 25.92s
Muda wa majibu (upeo) 1.27s 88.15s
Muda wa majibu (jumla) 8.91s 388.79s

Modeli bora kwa alama

Muda wa majibu (wastani)

Alama dhidi ya gharama ya jumla

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 466ms 274 0
OpenAI: GPT-5 Mini 7.0 9.6 66.7% 0 16.45s 1,645 5,824
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 606ms 131 0
OpenAI: GPT-5 Mini 10.0 10.0 100.0% 0 88.15s 754 11,520
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 5.5 5.9 83.3% 1 667ms 180 0
OpenAI: GPT-5 Mini 9.9 10.0 100.0% 0 12.58s 453 3,200
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 4.0 7.2 44.4% 1 534ms 46 0
OpenAI: GPT-5 Mini 10.0 7.2 22.2% 1 44.63s 293 14,016
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 5.5 10.0 50.0% 0 551ms 82 0
OpenAI: GPT-5 Mini 7.5 6.6 83.3% 1 15.66s 318 4,992
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 533ms 234 0
OpenAI: GPT-5 Mini 4.3 9.8 33.3% 0 14.09s 1,527 5,760
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 100.0% 0 1.27s 197 0
OpenAI: GPT-5 Mini 10.0 10.0 100.0% 0 18.64s 487 1,600

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho