Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

OpenAI: GPT-5.4 vs StepFun: Step 3.5 Flash

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-05

Kipimo OpenAI: GPT-5.4 none Toleo: 2026-03-05 StepFun: Step 3.5 Flash medium Toleo: 2026-02-01 Inapatikana bure
Nafasi #44 #16
Wastani wa alama 4.6 7.5
Majaribio sahihi
Uthabiti 8.9 9.0
Gharama kwa matokeo 1.496 0.000
Jumla ya gharama $0.090 $0.000
Kiwango cha kupita kwa kila jaribio 44.4% 73.3%
Majaribio yasiyo thabiti 2 2
common.totalAttempts 45 (15 x 3) 45 (15 x 3)
Tokeni za matokeo 1,635 69,238
Tokeni za hoja 0 152,563
Muda wa majibu (wastani) 1.46s 31.60s
Muda wa majibu (upeo) 2.89s 170.45s
Muda wa majibu (jumla) 21.86s 284.43s

Modeli bora kwa alama

Muda wa majibu (wastani)

Alama dhidi ya gharama ya jumla

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 7.3 11.1% 1 1.41s 388 0
StepFun: Step 3.5 Flash 10.0 10.0 100.0% 0 18.54s 13,924 17,208
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 0.0% 0 2.89s 291 0
StepFun: Step 3.5 Flash 10.0 10.0 100.0% 0 29.57s 1,176 12,984
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 9.9 10.0 100.0% 0 1.04s 222 0
StepFun: Step 3.5 Flash 10.0 10.0 100.0% 0 15.01s 600 13,886
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 7.2 44.4% 1 1.07s 50 0
StepFun: Step 3.5 Flash 4.0 7.2 44.4% 1 170.45s 45,350 90,436
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 5.5 10.0 50.0% 0 1.07s 81 0
StepFun: Step 3.5 Flash 9.0 6.8 83.3% 1 4.98s 2,284 3,412
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 4.0 9.8 33.3% 0 1.52s 357 0
StepFun: Step 3.5 Flash 4.0 10.0 33.3% 0 7.72s 5,629 10,835
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5.4 10.0 10.0 100.0% 0 2.75s 246 0
StepFun: Step 3.5 Flash 10.0 10.0 100.0% 0 11.91s 275 3,802

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho