Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

OpenAI: GPT-5 Nano vs StepFun: Step 3.5 Flash

Jina la modeli:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe : 2026-02-27 15:16

Muhtasari

Kipimo OpenAI: GPT-5 Nano medium Toleo: Tarehe ya kutolewa haijulikani StepFun: Step 3.5 Flash medium Toleo: Tarehe ya kutolewa haijulikani Inapatikana bure
Nafasi #23 #11
Alama 5.86 7.00
Uthabiti 6.60 8.32
Gharama kwa matokeo 0.519 0.000
Jumla ya gharama $0.037 $0.000
Majaribio sahihi
Majaribio yenye makosa 7 5
Kiwango cha kupita kwa kila jaribio 69.1% 73.8%
Majaribio yasiyo thabiti 6 3
Tokeni za matokeo 3,700 60,502
Tokeni za hoja 85,184 117,044

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 7.00 9.99 66.7% 0 1,107 19,968
StepFun: Step 3.5 Flash 10.00 10.00 100.0% 0 13,924 17,208
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 5.50 5.81 83.3% 1 426 8,576
StepFun: Step 3.5 Flash 10.00 10.00 100.0% 0 535 11,548
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 4.00 4.41 55.6% 2 195 33,600
StepFun: Step 3.5 Flash 4.00 7.21 44.4% 1 40,942 74,237
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 7.00 6.41 83.3% 1 360 4,032
StepFun: Step 3.5 Flash 10.00 10.00 100.0% 0 2,121 3,274
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 4.67 4.90 55.6% 2 1,054 12,032
StepFun: Step 3.5 Flash 2.00 4.96 33.3% 2 2,705 6,975
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
OpenAI: GPT-5 Nano 10.00 10.00 100.0% 0 558 6,976
StepFun: Step 3.5 Flash 10.00 10.00 100.0% 0 275 3,802

Badilisha jozi ya ulinganisho