Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

Anthropic: Claude Sonnet 4.6 vs OpenAI: GPT-5.3 Chat

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-03

Kipimo Anthropic: Claude Sonnet 4.6 medium Toleo: 2026-02-17 OpenAI: GPT-5.3 Chat none Toleo: 2026-03-03
Nafasi #11 #14
Wastani wa alama 7.43 7.27
Uthabiti 9.40 8.26
Gharama kwa matokeo 8.105 2.835
Jumla ya gharama $0.811 $0.256
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 73.8% 73.8%
Majaribio yasiyo thabiti 1 3
Tokeni za matokeo 29,098 16,339
Tokeni za hoja 20,435 0

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 7.00 10.00 66.7% 0 1,031 1,093
OpenAI: GPT-5.3 Chat 7.33 7.49 77.8% 1 3,091 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 727 907
OpenAI: GPT-5.3 Chat 9.88 10.00 100.0% 0 942 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 1.00 7.21 11.1% 1 25,790 16,919
OpenAI: GPT-5.3 Chat 1.00 4.41 33.3% 2 8,264 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 316 523
OpenAI: GPT-5.3 Chat 8.50 9.99 50.0% 0 1,455 0
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 579 642
OpenAI: GPT-5.3 Chat 10.00 10.00 100.0% 0 1,726 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
Anthropic: Claude Sonnet 4.6 10.00 10.00 100.0% 0 655 351
OpenAI: GPT-5.3 Chat 10.00 10.00 100.0% 0 861 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho