Urambazaji
AI BENCHY
Linganisha Chati Mbinu
❤️ Made by XCS
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Trinity Large Preview vs MoonshotAI: Kimi K2.5

Linganisha:

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-06

Kipimo Trinity Large Preview none Toleo: 2026-01-27 Inapatikana bure MoonshotAI: Kimi K2.5 none Toleo: 2026-01-27
Nafasi #45 #46
Wastani wa alama 4.2 4.1
Uthabiti 9.6 8.6
Gharama kwa matokeo 0.000 0.295
Jumla ya gharama $0.000 $0.015
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 33.3% 39.6%
Majaribio yasiyo thabiti 1 3
Jumla ya uendeshaji 48 (16 x 3) 48 (16 x 3)
Tokeni za matokeo 1,837 2,000
Tokeni za hoja 0 0
Muda wa majibu (wastani) 3.15s 11.91s
Muda wa majibu (upeo) 8.91s 42.13s
Muda wa majibu (jumla) 50.46s 107.16s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Wastani wa alama vs Muda wa majibu (wastani)

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 10.0 10.0 0.0% 0 3.59s 587 0
MoonshotAI: Kimi K2.5 2.7 7.9 11.1% 1 11.38s 363 0
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 10.0 10.0 0.0% 0 8.91s 294 0
MoonshotAI: Kimi K2.5 10.0 2.1 33.3% 1 19.16s 748 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 9.9 10.0 100.0% 0 3.26s 186 0
MoonshotAI: Kimi K2.5 5.4 5.8 83.3% 1 42.13s 187 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 4.0 10.0 33.3% 0 877ms 25 0
MoonshotAI: Kimi K2.5 4.0 10.0 33.3% 0 4.38s 29 0
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 3.0 9.9 0.0% 0 2.86s 124 0
MoonshotAI: Kimi K2.5 10.0 10.0 100.0% 0 4.00s 76 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 3.5 6.7 16.7% 1 1.09s 63 0
MoonshotAI: Kimi K2.5 5.5 10.0 50.0% 0 2.67s 60 0
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 4.0 10.0 33.3% 0 3.30s 291 0
MoonshotAI: Kimi K2.5 10.0 10.0 0.0% 0 4.73s 317 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Trinity Large Preview 10.0 10.0 100.0% 0 6.67s 267 0
MoonshotAI: Kimi K2.5 10.0 10.0 100.0% 0 13.99s 220 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho