#13
Stepfun ยท Rilis: 2026-02-01 ยท stepfun/step-3.5-flash::medium
Tes tidak stabil
2
Tes tidak stabil memiliki hasil campuran antar run (setidaknya satu lulus dan satu gagal).
Tidak mengikuti instruksi: 3 Jawaban salah: 3
Grafik
Pilih model pertama, lalu klik model kedua untuk membuka halaman berdampingan.
Perbandingan Cepat
Step 3.5 FlashmediumTersedia gratisvsGemini 3.1 Flash Lite PreviewmediumStep 3.5 FlashmediumTersedia gratisvsGLM 5mediumStep 3.5 FlashmediumTersedia gratisvsClaude Sonnet 4.6mediumStep 3.5 FlashmediumTersedia gratisvsGPT-5.2 ChatnoneStep 3.5 FlashmediumTersedia gratisvsQwen3.5-122B-A10BmediumStep 3.5 FlashmediumTersedia gratisvsGemini 2.5 FlashmediumStep 3.5 FlashmediumTersedia gratisvsGemini 3 Flash PreviewmediumStep 3.5 FlashmediumTersedia gratisvsGemini 3.1 Pro PreviewmediumStep 3.5 FlashmediumTersedia gratisvsTrinity Large PreviewnoneTersedia gratis
Rincian Kategori
| Kategori | Skor Rata-rata | Konsistensi | Tes benar |
|---|---|---|---|
| Anti-AI Tricks | 10.0 | 10.0 | |
| Combined | 10.0 | 10.0 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 4.0 | 7.2 | |
| General Intelligence | 6.0 | 10.0 | |
| Instructions following | 9.0 | 6.8 | |
| Puzzle Solving | 4.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 |