#37

Qwen3.6 27B

Qwen Release: 2026-04-20 Tested on: 2026-04-27 21:31 qwen/qwen3.6-27b::medium

(medium) (none)

7.9

Consistency

8.5

10.0

$0.043

Total Output Tokens

21,553

Input Price

$0.500 / 1M

Output Price

$2.000 / 1M

Wrong Tests: 2

Attempt pass rate: 77.8%

Flaky tests

Flaky tests had mixed outcomes across runs (at least one pass and one fail).

25.56s

Response Time (max): 47.48s

Response Time (total): 153.33s

No answer: 1 Wrong answer: 1

Run history

Tested on	Score	Reliability	Tests Correct	Total Cost	Compare
2026-04-27 21:48 New test added	7.0	10.0		$0.209	Compare
2026-04-27 21:31 First recorded run	7.9	10.0		$0.043	Current run

Run comparison

Run	Score	Consistency	Reliability	Tests Correct	Flaky tests	Total Output Tokens	Total Cost	Response Time (avg)
2026-04-27 21:31 · First recorded run	7.9	8.5	10.0	4/6	1	21,553	$0.043	25.56s
2026-04-27 21:48 · New test added	7.0	7.9	10.0	9/18	5	99,362	$0.209	50.53s
Difference	+0.9	+0.6	0.0	-5	-4	-77809	-$0.166	-24972ms