Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

Modeli zilizolinganishwa

Muhtasari

Ulinganisho wa benchmark Kimi K2.6 vs Kimi K2.5 vs GLM 5 vs Claude Opus 4.7Claude Opus 4.7 inaongoza kwenye Alama kwa 8.7. Kimi K2.6 inaongoza kwenye Uaminifu kwa 10.0. GLM 5 ina Jumla ya gharama ya chini zaidi kwa $0.228. Claude Opus 4.7 ndiyo ya haraka zaidi kwa 4.73s.

Muundo unaopendekezwa: Claude Opus 4.7 - It has the best score here (8.7), while responding about 14.4x faster than miundo mingine katika ulinganisho huu.

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-06-10

Kipimo Kimi K2.6 Kimi K2.6 medium Toleo: 2026-04-20 Inapatikana bure Kimi K2.5 Kimi K2.5 medium Toleo: 2026-01-27 GLM 5 GLM 5 medium Toleo: 2026-02-12 Claude Opus 4.7 Claude Opus 4.7 medium Toleo: 2026-04-16
Alama 7.2 6.8 8.3 8.7
Nafasi #61 #77 #18 #11
Uaminifu 10.0 10.0 10.0 10.0
Uthabiti 8.6 6.9 8.5 9.6
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 65.1% 68.3% 82.5% 82.5%
Majaribio yasiyo thabiti 3 8 4 1
Jumla ya uendeshaji 63 63 63 63
Gharama kwa matokeo 8.358 3.704 1.668 3.991
Jumla ya gharama $0.889 $0.328 $0.228 $0.679
Bei ya ingizo $0.680 / 1M $0.400 / 1M $0.600 / 1M $5.000 / 1M
Bei ya toleo $3.410 / 1M $1.900 / 1M $1.920 / 1M $25.000 / 1M
Jumla ya tokeni za ingizo 29,450 34,312 35,224 65,406
Tokeni za matokeo 102,923 48,379 21,570 11,858
Tokeni za hoja 254,094 157,747 102,996 2,198
Muda wa majibu (wastani) 71.67s 98.43s 33.54s 4.73s
Muda wa majibu (upeo) 406.78s 281.00s 99.85s 23.18s
Muda wa majibu (jumla) 1433.36s 1378.03s 435.99s 94.51s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#61 MoonshotAI: Kimi K2.6

medium
Cost
$0.013
Time
103.4s
Tokens
3,620 tok

#77 MoonshotAI: Kimi K2.5

medium
Cost
$0.030
Time
58.6s
Tokens
8,683 tok

#18 GLM 5

medium
Cost
$0.005
Time
20.7s
Tokens
2,068 tok

#11 Claude Opus 4.7

medium
Cost
$0.059
Time
26.8s
Tokens
2,475 tok

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 7.0 8.0 66.7% 1 11.59s 618 7,115 8,934
Kimi K2.5 7.3 5.8 83.3% 2 51.38s 634 2,789 8,880
GLM 5 10.0 10.0 100.0% 0 23.66s 555 480 7,056
Claude Opus 4.7 8.3 10.0 75.0% 0 1.85s 894 348 0
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 5.7 8.6 33.3% 0 214.42s 2,925 9,970 77,189
Kimi K2.5 6.1 4.6 66.7% 2 217.49s 6,935 5,705 74,693
GLM 5 10.0 10.0 100.0% 0 74.30s 7,254 2,997 52,930
Claude Opus 4.7 7.6 7.2 77.8% 1 12.96s 10,635 7,629 1,114
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 10.0 10.0 100.0% 0 40.96s 11,271 711 13,876
Kimi K2.5 10.0 10.0 100.0% 0 71.37s 11,280 703 3,713
GLM 5 10.0 10.0 100.0% 0 28.96s 12,804 662 3,242
Claude Opus 4.7 10.0 10.0 100.0% 0 21.45s 24,501 2,369 1,084
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 10.0 10.0 100.0% 0 20.38s 7,014 316 11,305
Kimi K2.5 10.0 10.0 100.0% 0 49.78s 7,020 563 7,940
GLM 5 7.1 5.6 83.3% 1 8.90s 5,508 567 3,734
Claude Opus 4.7 10.0 10.0 100.0% 0 2.37s 10,533 324 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 5.3 7.2 44.4% 1 202.38s 326 47,035 98,262
Kimi K2.5 3.5 4.4 33.3% 2 137.29s 485 20,753 30,564
GLM 5 3.5 4.4 33.3% 2 0ms 260 13,176 14,137
Claude Opus 4.7 7.7 10.0 66.7% 0 1.17s 630 51 0
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 10.0 10.0 100.0% 0 17.83s 477 3,981 4,472
Kimi K2.5 6.5 3.4 66.7% 1 69.73s 480 3,815 4,262
GLM 5 6.1 3.1 66.7% 1 14.69s 477 2,020 2,248
Claude Opus 4.7 10.0 10.0 100.0% 0 2.87s 723 256 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 10.0 10.0 100.0% 0 12.53s 669 3,977 5,269
Kimi K2.5 10.0 10.0 100.0% 0 92.47s 675 5,371 6,547
GLM 5 10.0 10.0 100.0% 0 7.25s 636 1,001 2,129
Claude Opus 4.7 10.0 10.0 100.0% 0 1.57s 939 114 0
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 6.0 7.4 55.6% 1 25.06s 651 13,860 17,599
Kimi K2.5 5.3 7.3 44.4% 1 43.23s 659 8,426 12,692
GLM 5 10.0 10.0 100.0% 0 11.33s 609 33 4,076
Claude Opus 4.7 10.0 10.0 100.0% 0 2.43s 939 370 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 10.0 10.0 100.0% 0 8.92s 5,286 248 1,011
Kimi K2.5 10.0 10.0 100.0% 0 31.74s 5,933 242 812
GLM 5 10.0 10.0 100.0% 0 15.93s 6,935 233 994
Claude Opus 4.7 10.0 10.0 100.0% 0 4.17s 15,339 373 0
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za ingizo Tokeni za matokeo Tokeni za hoja
Kimi K2.6 3.0 10.0 0.0% 0 130.27s 213 15,710 16,177
Kimi K2.5 3.0 10.0 0.0% 0 83.95s 211 12 7,644
GLM 5 3.0 10.0 0.0% 0 67.37s 186 401 12,450
Claude Opus 4.7 3.0 10.0 0.0% 0 2.25s 273 24 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho