AI BENCHY Compare

MoonshotAI: Kimi K2.6 vs Owl Alpha

Benchmarks gegenereerd uit AI BENCHY-testsuites op: 2026-04-30

Metriek	Kimi K2.6 Kimi K2.6 none Releasedatum: 2026-04-20	Owl Alpha Owl Alpha medium Releasedatum: 2026-04-30

Metriek	Kimi K2.6 Kimi K2.6 none Releasedatum: 2026-04-20	Owl Alpha Owl Alpha medium Releasedatum: 2026-04-30
Score	5.8	5.8
Rang	#92	#91
Betrouwbaarheid	n.v.t.	10.0
Consistentie	9.1	9.5
Correcte tests
Slaagpercentage per poging	42.6%	40.7%
Instabiele tests	2	1
Totaal runs	54	54
Kosten per resultaat	0.543	0.000
Totale kosten	$0.038	$0.000
Invoerprijs	$0.740 / 1M	$0.000 / 1M
Uitvoerprijs	$3.490 / 1M	$0.000 / 1M
Uitvoer-tokens	2,973	1,596
Redeneer-tokens	0	0
Responstijd (gem.)	2.05s	11.04s
Responstijd (max)	6.65s	58.63s
Responstijd (totaal)	36.93s	198.65s

Topmodellen op score

Score vs totale kosten

Responstijd (gem.)

Score vs Responstijd (gem.)

Totaal aantal uitvoer-tokens

Score vs Totaal aantal uitvoer-tokens

Categorie-uitsplitsing

Anti-AI-trucs	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	4.6	10.0	25.0%	0		1.39s	471	0
Owl Alpha	4.8	10.0	25.0%	0		3.97s	87	0

Programmeren	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	10.0	10.0	100.0%	0		6.65s	1,176	0
Owl Alpha	10.0	10.0	100.0%	0		7.35s	402	0

Gecombineerd	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	3.0	10.0	0.0%	0		3.38s	290	0
Owl Alpha	3.0	10.0	0.0%	0		10.01s	315	0

Gegevensparsering en extractie	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	10.0	10.0	100.0%	0		1.32s	201	0
Owl Alpha	10.0	10.0	100.0%	0		21.64s	246	0

Domeinspecifiek	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	5.3	7.2	44.4%	1		1.48s	42	0
Owl Alpha	5.3	10.0	33.3%	0		8.58s	28	0

Algemene intelligentie	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	5.4	3.5	33.3%	1		1.55s	138	0
Owl Alpha	4.3	10.0	0.0%	0		58.63s	98	0

Instructies opvolgen	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	6.5	10.0	50.0%	0		1.64s	72	0
Owl Alpha	6.3	10.0	50.0%	0		9.59s	57	0

Puzzeloplossing	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	3.4	9.7	0.0%	0		1.66s	343	0
Owl Alpha	3.4	7.2	11.1%	1		3.44s	135	0

Toolaanroepen	Score	Consistentie	Slaagpercentage per poging	Instabiele tests	Correcte tests	Responstijd (gem.)	Uitvoer-tokens	Redeneer-tokens
Kimi K2.6	10.0	10.0	100.0%	0		4.46s	240	0
Owl Alpha	10.0	10.0	100.0%	0		8.26s	228	0

Snelle vergelijking

Vergelijkingspaar wisselen

Kimi K2.6nonevsgpt-oss-120bmediumGratis beschikbaar GPT-5.4nonevsOwl Alphamedium Owl AlphamediumvsQwen3.5-122B-A10Bnone Owl AlphamediumvsQwen3.5 Plus 2026-04-20none Owl AlphamediumvsMiMo-V2.5-Pronone Owl AlphamediumvsQwen3.6 Flashnone Owl AlphamediumvsGLM 5.1none Owl AlphamediumvsMiMo-V2-Pronone DeepSeek V3.2nonevsOwl Alphamedium Owl AlphamediumvsQwen3.5-27Bnone Mistral Small 4mediumvsKimi K2.6none Owl AlphamediumvsQwen3.6 27Bnone