#45
Arcee AI · 发布日期: 2026-01-27 · arcee-ai/trinity-large-preview::none
不稳定测试
1
不稳定测试在运行之间出现混合结果(至少一次通过且至少一次失败)。
答案错误: 9 未遵循指令: 2
图表
先选择第一个模型,再点击第二个模型打开并排页面。
快速对比
Trinity Large Previewnone免费可用vsGPT-5.4noneTrinity Large Previewnone免费可用vsKimi K2.5noneTrinity Large Previewnone免费可用vsMiniMax M2.5mediumTrinity Large Previewnone免费可用vsGPT-4o-mininoneTrinity Large Previewnone免费可用vsQwen3.5-35B-A3BnoneTrinity Large Previewnone免费可用vsQwen3 Coder NextnoneTrinity Large Previewnone免费可用vsGemini 3 Flash PreviewmediumTrinity Large Previewnone免费可用vsGemini 3.1 Pro PreviewmediumTrinity Large Previewnone免费可用vsStep 3.5 Flashmedium免费可用
类别细分
| 类别 | 平均分 | 一致性 | 测试正确 |
|---|---|---|---|
| Anti-AI Tricks | 10.0 | 10.0 | |
| Combined | 10.0 | 10.0 | |
| Data parsing and extraction | 9.9 | 10.0 | |
| Domain specific | 4.0 | 10.0 | |
| General Intelligence | 3.0 | 9.9 | |
| Instructions following | 3.5 | 6.7 | |
| Puzzle Solving | 4.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 |