Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

DeepSeek: DeepSeek V3.2 vs MoonshotAI: Kimi K2.5

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-04-16

Kipimo DeepSeek V3.2 DeepSeek V3.2 none Toleo: 2025-12-01 Kimi K2.5 Kimi K2.5 medium Toleo: 2026-01-27
Alama 6.1 7.0
Nafasi #63 #45
Uthabiti 8.1 6.8
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 50.0% 72.2%
Majaribio yasiyo thabiti 4 7
Jumla ya uendeshaji 54 54
Gharama kwa matokeo 0.226 2.444
Jumla ya gharama $0.016 $0.220
Bei ya ingizo $0.260 / 1M $0.383 / 1M
Bei ya toleo $0.380 / 1M $1.720 / 1M
Tokeni za matokeo 8,384 42,176
Tokeni za hoja 0 84,870
Muda wa majibu (wastani) 12.09s 72.43s
Muda wa majibu (upeo) 115.89s 150.77s
Muda wa majibu (jumla) 217.56s 796.70s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 3.2 9.8 0.0% 0 7.63s 1,419 0
Kimi K2.5 7.3 5.8 83.3% 2 51.38s 2,789 8,880
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 2.4 1.3 33.3% 1 7.63s 553 0
Kimi K2.5 4.7 1.6 66.7% 1 150.77s 1,269 9,749
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 6.5 10.0 0.0% 0 115.89s 2,887 0
Kimi K2.5 10.0 10.0 100.0% 0 71.37s 703 3,713
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 6.3 5.8 66.7% 1 9.42s 1,710 0
Kimi K2.5 10.0 10.0 100.0% 0 49.78s 563 7,940
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 3.6 7.2 22.2% 1 1.61s 24 0
Kimi K2.5 3.5 4.4 33.3% 2 137.29s 20,753 30,564
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 10.0 10.0 100.0% 0 2.86s 67 0
Kimi K2.5 6.5 3.4 66.7% 1 69.73s 3,815 4,262
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 10.0 10.0 100.0% 0 1.52s 66 0
Kimi K2.5 10.0 10.0 100.0% 0 92.47s 5,371 6,547
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 8.5 7.5 88.9% 1 7.37s 1,136 0
Kimi K2.5 5.3 7.3 44.4% 1 45.40s 6,671 12,403
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
DeepSeek V3.2 10.0 10.0 100.0% 0 11.85s 522 0
Kimi K2.5 10.0 10.0 100.0% 0 31.74s 242 812

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho