AI BENCHY Compare
Arcee AI: Trinity Large Preview (free) vs MoonshotAI: Kimi K2.5
Benchmark dihasilkan dari suite pengujian AI BENCHY pada: 2026-03-03
| Metrik | Arcee AI: Trinity Large Preview (free) none Rilis: 2026-01-27 Tersedia gratis | MoonshotAI: Kimi K2.5 none Rilis: 2026-01-27 |
|---|---|---|
| Peringkat | #33 | #35 |
| Skor Rata-rata | 4.34 | 4.07 |
| Konsistensi | 9.97 | 8.92 |
| Biaya per hasil | 0.000 | 0.232 |
| Total Biaya | $0.000 | $0.010 |
| Tes benar | 5/14 | 4/14 |
| Tingkat lulus per percobaan | 35.7% | 35.7% |
| Tes tidak stabil | 0 | 2 |
| Token output | 1,415 | 1,915 |
| Token penalaran | 0 | 0 |
Rincian Kategori
| Trik anti-AI | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 1.00 | 10.00 | 0.0% | 0 | 587 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.67 | 7.86 | 11.1% | 1 | 363 | 0 |
| Parsing dan ekstraksi data | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 9.88 | 10.00 | 100.0% | 0 | 186 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.50 | 5.81 | 83.3% | 1 | 995 | 0 |
| Spesifik domain | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 10.00 | 33.3% | 0 | 21 | 0 | |
| MoonshotAI: Kimi K2.5 | 4.00 | 10.00 | 33.3% | 0 | 29 | 0 |
| Kepatuhan instruksi | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 2.00 | 9.79 | 0.0% | 0 | 63 | 0 | |
| MoonshotAI: Kimi K2.5 | 5.00 | 9.99 | 50.0% | 0 | 61 | 0 |
| Puzzle Solving | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 4.00 | 9.99 | 33.3% | 0 | 291 | 0 | |
| MoonshotAI: Kimi K2.5 | 2.00 | 9.92 | 0.0% | 0 | 247 | 0 |
| Pemanggilan alat | Skor | Konsistensi | Tingkat lulus per percobaan | Tes tidak stabil | Tes benar | Token output | Token penalaran |
|---|---|---|---|---|---|---|---|
| Arcee AI: Trinity Large Preview (free) | 10.00 | 10.00 | 100.0% | 0 | 267 | 0 | |
| MoonshotAI: Kimi K2.5 | 10.00 | 10.00 | 100.0% | 0 | 220 | 0 |
Perbandingan Cepat
Ganti Pasangan Perbandingan
Kimi K2.5nonevsGLM 4.7 FlashmediumTrinity Large Preview (free)noneTersedia gratisvsGLM 4.7 FlashmediumKimi K2.5nonevsQwen3 Coder NextmediumTrinity Large Preview (free)noneTersedia gratisvsQwen3 Coder NextmediumTrinity Large Preview (free)noneTersedia gratisvsMiniMax M2.5mediumTrinity Large Preview (free)noneTersedia gratisvsgpt-oss-120bmediumTersedia gratisTrinity Large Preview (free)noneTersedia gratisvsQwen3.5-FlashmediumTrinity Large Preview (free)noneTersedia gratisvsGPT-5 NanomediumMiniMax M2.5mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumTersedia gratisTrinity Large Preview (free)noneTersedia gratisvsQwen3.5-35B-A3BmediumTrinity Large Preview (free)noneTersedia gratisvsMiMo-V2-Flashmedium