Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

Inception: Mercury 2 vs OpenAI: gpt-oss-120b

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-05

Kipimo Inception: Mercury 2 none Toleo: 2026-02-24 OpenAI: gpt-oss-120b medium Toleo: 2025-08-05 Inapatikana bure
Wastani wa alama 3.4 5.2
Majaribio sahihi
Nafasi #50 #36
Uthabiti 8.9 7.2
Gharama kwa matokeo 0.147 0.133
Jumla ya gharama $0.006 $0.010
Kiwango cha kupita kwa kila jaribio 33.3% 57.8%
Majaribio yasiyo thabiti 2 5
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 1,144 13,103
Tokeni za hoja 0 33,843
Muda wa majibu (wastani) 594ms 17.75s
Muda wa majibu (upeo) 1.27s 50.92s
Muda wa majibu (jumla) 8.91s 141.98s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 466ms 274 0
OpenAI: gpt-oss-120b 7.0 9.8 66.7% 0 19.76s 3,463 2,077
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 606ms 131 0
OpenAI: gpt-oss-120b 10.0 10.0 100.0% 0 31.18s 694 5,072
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 5.5 5.9 83.3% 1 667ms 180 0
OpenAI: gpt-oss-120b 5.5 5.9 66.7% 1 1.98s 241 1,114
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 4.0 7.2 44.4% 1 534ms 46 0
OpenAI: gpt-oss-120b 10.0 4.4 22.2% 2 50.92s 6,784 20,606
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 5.5 10.0 50.0% 0 551ms 82 0
OpenAI: gpt-oss-120b 9.5 10.0 100.0% 0 7.63s 126 1,799
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 0.0% 0 533ms 234 0
OpenAI: gpt-oss-120b 1.7 4.7 22.2% 2 11.80s 1,508 2,092
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Inception: Mercury 2 10.0 10.0 100.0% 0 1.27s 197 0
OpenAI: gpt-oss-120b 9.0 10.0 100.0% 0 6.91s 287 1,083

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho