AI BENCHY Compare
Arcee AI: Trinity Large Preview (free) vs MoonshotAI: Kimi K2.5
Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-03
| Kipimo | Arcee AI: Trinity Large Preview (free) none Toleo: 2026-01-27 Inapatikana bure | MoonshotAI: Kimi K2.5 none Toleo: 2026-01-27 |
|---|---|---|
| Nafasi | #33 | #35 |
| Wastani wa alama | 4.34 | 4.07 |
| Uthabiti | 9.97 | 8.92 |
| Gharama kwa matokeo | 0.000 | 0.232 |
| Jumla ya gharama | $0.000 | $0.010 |
| Majaribio sahihi | 5/14 | 4/14 |
| Kiwango cha kupita kwa kila jaribio | 35.7% | 35.7% |
| Majaribio yasiyo thabiti | 0 | 2 |
| Tokeni za matokeo | 1,415 | 1,915 |
| Tokeni za hoja | 0 | 0 |
Mgawanyo wa kategoria
| Mbinu za kupinga AI | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 1.00 | 10.00 | 0.0% | 0 | 587 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.67 | 7.86 | 11.1% | 1 | 363 | 0 |
| Uchanganuzi na uchimbaji wa data | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 9.88 | 10.00 | 100.0% | 0 | 186 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.50 | 5.81 | 83.3% | 1 | 995 | 0 |
| Mahususi kwa domeni | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 10.00 | 33.3% | 0 | 21 | 0 | |
| MoonshotAI: Kimi K2.5 | 4.00 | 10.00 | 33.3% | 0 | 29 | 0 |
| Ufuataji wa maagizo | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 2.00 | 9.79 | 0.0% | 0 | 63 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.00 | 9.99 | 50.0% | 0 | 61 | 0 |
| Puzzle Solving | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 9.99 | 33.3% | 0 | 291 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.00 | 9.92 | 0.0% | 0 | 247 | 0 |
| Mwito wa zana | Alama | Uthabiti | Kiwango cha kupita kwa kila jaribio | Majaribio yasiyo thabiti | Majaribio sahihi | Tokeni za matokeo | Tokeni za hoja |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 10.00 | 10.00 | 100.0% | 0 | 267 | 0 | |
| MoonshotAI: Kimi K2.5 | 10.00 | 10.00 | 100.0% | 0 | 220 | 0 |
Ulinganisho wa haraka
Badilisha jozi ya ulinganisho
Kimi K2.5nonevsGLM 4.7 FlashmediumTrinity Large Preview (free)noneInapatikana burevsGLM 4.7 FlashmediumKimi K2.5nonevsQwen3 Coder NextmediumTrinity Large Preview (free)noneInapatikana burevsQwen3 Coder NextmediumTrinity Large Preview (free)noneInapatikana burevsMiniMax M2.5mediumTrinity Large Preview (free)noneInapatikana burevsgpt-oss-120bmediumInapatikana bureTrinity Large Preview (free)noneInapatikana burevsQwen3.5-FlashmediumTrinity Large Preview (free)noneInapatikana burevsGPT-5 NanomediumMiniMax M2.5mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumInapatikana bureTrinity Large Preview (free)noneInapatikana burevsQwen3.5-35B-A3BmediumTrinity Large Preview (free)noneInapatikana burevsMiMo-V2-Flashmedium