AI BENCHY Compare
MoonshotAI: Kimi K2.5 vs Owl Alpha
Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-22
| Kipimo | Kimi K2.5 Kimi K2.5 none | Owl Alpha Owl Alpha none |
|---|---|---|
| Alama | 5.3 | 5.7 |
| Nafasi | #126 | #106 |
| Uaminifu | 10.0 | 10.0 |
| Uthabiti | 8.9 | 9.2 |
| Majaribio sahihi | ||
| Kiwango cha kupita kwa kila jaribio | 36.7% | 41.7% |
| Majaribio yasiyo thabiti | 3 | 2 |
| Jumla ya uendeshaji | 60 | 60 |
| Gharama kwa matokeo | 0.428 | 0.000 |
| Jumla ya gharama | $0.026 | $0.000 |
| Bei ya ingizo | $0.400 / 1M | $0.000 / 1M |
| Bei ya toleo | $1.900 / 1M | $0.000 / 1M |
| Tokeni za matokeo | 6,734 | 4,864 |
| Tokeni za hoja | 0 | 0 |
| Muda wa majibu (wastani) | 14.16s | 8.84s |
| Muda wa majibu (upeo) | 42.13s | 47.10s |
| Muda wa majibu (jumla) | 184.10s | 176.83s |
Alama dhidi ya gharama ya jumla
Muda wa majibu (wastani)
Alama vs Muda wa majibu (wastani)
Jumla ya tokeni za matokeo
Alama vs Jumla ya tokeni za matokeo
Mgawanyo wa kategoria
Ulinganisho wa haraka
Badilisha jozi ya ulinganisho
CobuddymediumInapatikana burevsOwl AlphanoneKimi K2.5nonevsElephant AlphamediumMistral Small 4mediumvsKimi K2.5noneMiniMax M2.5mediumInapatikana burevsKimi K2.5nonegpt-oss-120bmediumInapatikana burevsOwl AlphanoneNemotron 3 SupermediumInapatikana burevsOwl AlphanoneMiniMax M2.7mediumvsKimi K2.5noneKimi K2.5nonevsgpt-oss-120bmediumInapatikana bureMiniMax M2.5mediumInapatikana burevsOwl AlphanoneMistral Small 4mediumvsOwl AlphanoneGPT-5 NanomediumvsOwl AlphanoneCobuddymediumInapatikana burevsKimi K2.5none