#45
Arcee AI ยท Rilis: 2026-01-27 ยท arcee-ai/trinity-large-preview::none
Tes tidak stabil
1
Tes tidak stabil memiliki hasil campuran antar run (setidaknya satu lulus dan satu gagal).
Jawaban salah: 9 Tidak mengikuti instruksi: 2
Grafik
Pilih model pertama, lalu klik model kedua untuk membuka halaman berdampingan.
Perbandingan Cepat
Trinity Large PreviewnoneTersedia gratisvsGPT-5.4noneTrinity Large PreviewnoneTersedia gratisvsKimi K2.5noneTrinity Large PreviewnoneTersedia gratisvsMiniMax M2.5mediumTrinity Large PreviewnoneTersedia gratisvsGPT-4o-mininoneTrinity Large PreviewnoneTersedia gratisvsQwen3.5-35B-A3BnoneTrinity Large PreviewnoneTersedia gratisvsQwen3 Coder NextnoneTrinity Large PreviewnoneTersedia gratisvsGemini 3 Flash PreviewmediumTrinity Large PreviewnoneTersedia gratisvsGemini 3.1 Pro PreviewmediumTrinity Large PreviewnoneTersedia gratisvsStep 3.5 FlashmediumTersedia gratis
Rincian Kategori
| Kategori | Skor Rata-rata | Konsistensi | Tes benar |
|---|---|---|---|
| Anti-AI Tricks | 10.0 | 10.0 | |
| Combined | 10.0 | 10.0 | |
| Data parsing and extraction | 9.9 | 10.0 | |
| Domain specific | 4.0 | 10.0 | |
| General Intelligence | 3.0 | 9.9 | |
| Instructions following | 3.5 | 6.7 | |
| Puzzle Solving | 4.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 |