Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

Anthropic: Claude Sonnet 4.6 vs OpenAI: gpt-oss-120b

Jina la modeli:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe : 2026-02-27 15:16

Muhtasari

Kipimo Anthropic: Claude Sonnet 4.6 medium Toleo: Tarehe ya kutolewa haijulikani OpenAI: gpt-oss-120b medium Toleo: Tarehe ya kutolewa haijulikani Inapatikana bure
Nafasi #8 #25
Alama 7.43 5.64
Uthabiti 9.40 7.55
Gharama kwa matokeo 8.105 0.101
Jumla ya gharama $0.811 $0.008
Majaribio sahihi
Majaribio yenye makosa 4 7
Kiwango cha kupita kwa kila jaribio 73.8% 59.5%
Majaribio yasiyo thabiti 1 4
Tokeni za matokeo 29,098 11,407
Tokeni za hoja 20,435 26,106

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 7.00 10.00 66.7% 0 1,031 1,093
OpenAI: gpt-oss-120b 7.00 9.81 66.7% 0 3,463 2,077
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 727 907
OpenAI: gpt-oss-120b 5.50 5.87 66.7% 1 241 1,114
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 1.00 7.21 11.1% 1 25,790 16,919
OpenAI: gpt-oss-120b 1.00 4.41 22.2% 2 6,018 18,520
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 316 523
OpenAI: gpt-oss-120b 10.00 10.00 100.0% 0 120 1,770
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 579 642
OpenAI: gpt-oss-120b 5.00 7.13 44.4% 1 1,278 1,542
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 655 351
OpenAI: gpt-oss-120b 9.00 9.97 100.0% 0 287 1,083

Badilisha jozi ya ulinganisho