Urambazaji
AI BENCHY
Linganisha Chati
❤️ Made by XCS
Your ad here

AI BENCHY Compare

MoonshotAI: Kimi K2.5 vs OpenAI: GPT-5.3-Codex

Jina la modeli:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe : 2026-02-27 15:16

Muhtasari

Kipimo MoonshotAI: Kimi K2.5 medium Toleo: Tarehe ya kutolewa haijulikani OpenAI: GPT-5.3-Codex medium Toleo: Tarehe ya kutolewa haijulikani
Nafasi #17 #7
Alama 6.29 7.93
Uthabiti 7.69 8.84
Gharama kwa matokeo 2.335 4.641
Jumla ya gharama $0.187 $0.465
Majaribio sahihi
Majaribio yenye makosa 6 4
Kiwango cha kupita kwa kila jaribio 73.8% 78.6%
Majaribio yasiyo thabiti 4 2
Tokeni za matokeo 30,504 1,201
Tokeni za hoja 58,467 30,056

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 7.00 7.21 88.9% 1 335 6,255
OpenAI: GPT-5.3-Codex 10.00 10.00 100.0% 0 216 1,421
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 10.00 10.00 100.0% 0 1,181 6,049
OpenAI: GPT-5.3-Codex 10.00 10.00 100.0% 0 234 735
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 1.00 4.41 33.3% 2 20,696 30,894
OpenAI: GPT-5.3-Codex 4.00 7.21 55.6% 1 64 25,308
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 9.50 10.00 100.0% 0 3,777 4,967
OpenAI: GPT-5.3-Codex 9.00 10.00 50.0% 0 93 693
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 5.00 7.61 55.6% 1 4,273 9,490
OpenAI: GPT-5.3-Codex 7.00 7.38 77.8% 1 340 1,407
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Tokeni za matokeo Tokeni za hoja
MoonshotAI: Kimi K2.5 10.00 10.00 100.0% 0 242 812
OpenAI: GPT-5.3-Codex 10.00 10.00 100.0% 0 254 492

Badilisha jozi ya ulinganisho