AI BENCHY Compare
MoonshotAI: Kimi K2.5 vs Owl Alpha
Last updated at: 2026-05-22
| Metric | Kimi K2.5 Kimi K2.5 none | Owl Alpha Owl Alpha none |
|---|---|---|
| Score | 5.3 | 5.7 |
| Rank | #126 | #106 |
| Reliability | 10.0 | 10.0 |
| Consistency | 8.9 | 9.2 |
| Tests Correct | ||
| Attempt pass rate | 36.7% | 41.7% |
| Flaky tests | 3 | 2 |
| Total Runs | 60 | 60 |
| Cost per result | 0.428 | 0.000 |
| Total Cost | $0.026 | $0.000 |
| Input Price | $0.400 / 1M | $0.000 / 1M |
| Output Price | $1.900 / 1M | $0.000 / 1M |
| Output Tokens | 6,734 | 4,864 |
| Reasoning Tokens | 0 | 0 |
| Response Time (avg) | 14.16s | 8.84s |
| Response Time (max) | 42.13s | 47.10s |
| Response Time (total) | 184.10s | 176.83s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
CobuddymediumFree AvailablevsOwl AlphanoneKimi K2.5nonevsElephant AlphamediumMistral Small 4mediumvsKimi K2.5noneMiniMax M2.5mediumFree AvailablevsKimi K2.5nonegpt-oss-120bmediumFree AvailablevsOwl AlphanoneNemotron 3 SupermediumFree AvailablevsOwl AlphanoneMiniMax M2.7mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumFree AvailableMiniMax M2.5mediumFree AvailablevsOwl AlphanoneMistral Small 4mediumvsOwl AlphanoneGPT-5 NanomediumvsOwl AlphanoneCobuddymediumFree AvailablevsKimi K2.5none