AI BENCHY Compare
Arcee AI: Trinity Large Preview (free) vs MoonshotAI: Kimi K2.5
Benchmarks gegenereerd uit AI BENCHY-testsuites op: 2026-03-03
| Metriek | Arcee AI: Trinity Large Preview (free) none Releasedatum: 2026-01-27 Gratis beschikbaar | MoonshotAI: Kimi K2.5 none Releasedatum: 2026-01-27 |
|---|---|---|
| Rang | #33 | #35 |
| Gem. score | 4.34 | 4.07 |
| Consistentie | 9.97 | 8.92 |
| Kosten per resultaat | 0.000 | 0.232 |
| Totale kosten | $0.000 | $0.010 |
| Correcte tests | 5/14 | 4/14 |
| Slaagpercentage per poging | 35.7% | 35.7% |
| Instabiele tests | 0 | 2 |
| Uitvoer-tokens | 1,415 | 1,915 |
| Redeneer-tokens | 0 | 0 |
Categorie-uitsplitsing
| Anti-AI-trucs | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 1.00 | 10.00 | 0.0% | 0 | 587 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.67 | 7.86 | 11.1% | 1 | 363 | 0 |
| Gegevensparsering en extractie | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 9.88 | 10.00 | 100.0% | 0 | 186 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.50 | 5.81 | 83.3% | 1 | 995 | 0 |
| Domeinspecifiek | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 10.00 | 33.3% | 0 | 21 | 0 | |
| MoonshotAI: Kimi K2.5 | 4.00 | 10.00 | 33.3% | 0 | 29 | 0 |
| Instructies opvolgen | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 2.00 | 9.79 | 0.0% | 0 | 63 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.00 | 9.99 | 50.0% | 0 | 61 | 0 |
| Puzzle Solving | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 9.99 | 33.3% | 0 | 291 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.00 | 9.92 | 0.0% | 0 | 247 | 0 |
| Toolaanroepen | Score | Consistentie | Slaagpercentage per poging | Instabiele tests | Correcte tests | Uitvoer-tokens | Redeneer-tokens |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 10.00 | 10.00 | 100.0% | 0 | 267 | 0 | |
| MoonshotAI: Kimi K2.5 | 10.00 | 10.00 | 100.0% | 0 | 220 | 0 |
Snelle vergelijking
Vergelijkingspaar wisselen
Kimi K2.5nonevsGLM 4.7 FlashmediumTrinity Large Preview (free)noneGratis beschikbaarvsGLM 4.7 FlashmediumKimi K2.5nonevsQwen3 Coder NextmediumTrinity Large Preview (free)noneGratis beschikbaarvsQwen3 Coder NextmediumTrinity Large Preview (free)noneGratis beschikbaarvsMiniMax M2.5mediumTrinity Large Preview (free)noneGratis beschikbaarvsgpt-oss-120bmediumGratis beschikbaarTrinity Large Preview (free)noneGratis beschikbaarvsQwen3.5-FlashmediumTrinity Large Preview (free)noneGratis beschikbaarvsGPT-5 NanomediumMiniMax M2.5mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumGratis beschikbaarTrinity Large Preview (free)noneGratis beschikbaarvsQwen3.5-35B-A3BmediumTrinity Large Preview (free)noneGratis beschikbaarvsMiMo-V2-Flashmedium