Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

Anthropic: Claude Opus 4.6 vs OpenAI: GPT-5 Nano

Jina la modeli:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe : 2026-02-27 15:16

Muhtasari

Kipimo Anthropic: Claude Opus 4.6 medium Toleo: Tarehe ya kutolewa haijulikani OpenAI: GPT-5 Nano medium Toleo: Tarehe ya kutolewa haijulikani
Nafasi #20 #23
Alama 6.07 5.86
Uthabiti 8.80 6.60
Gharama kwa matokeo 10.992 0.519
Jumla ya gharama $0.880 $0.037
Majaribio sahihi
Majaribio yenye makosa 6 7
Kiwango cha kupita kwa kila jaribio 61.9% 69.1%
Majaribio yasiyo thabiti 2 6
Tokeni za matokeo 19,576 3,700
Tokeni za hoja 11,081 85,184

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 4.00 4.41 55.6% 2 897 1,000
OpenAI: GPT-5 Nano 7.00 9.99 66.7% 0 1,107 19,968
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 10.00 10.00 100.0% 0 668 763
OpenAI: GPT-5 Nano 5.50 5.81 83.3% 1 426 8,576
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 1.00 10.00 0.0% 0 16,328 7,928
OpenAI: GPT-5 Nano 4.00 4.41 55.6% 2 195 33,600
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 9.50 9.99 100.0% 0 266 468
OpenAI: GPT-5 Nano 7.00 6.41 83.3% 1 360 4,032
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 7.00 10.00 66.7% 0 556 593
OpenAI: GPT-5 Nano 4.67 4.90 55.6% 2 1,054 12,032
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Opus 4.6 10.00 10.00 100.0% 0 861 329
OpenAI: GPT-5 Nano 10.00 10.00 100.0% 0 558 6,976

Badilisha jozi ya ulinganisho