#37

Qwen3.6 27B

Qwen Release: 2026-04-20 Tested on: 2026-04-27 21:31 qwen/qwen3.6-27b::medium

(medium) (none)

Summary

Qwen3.6 27B scores 7.9 on AI BENCHY and ranks #37. It has 10.0 reliability, a 77.8% pass rate, $0.043 total cost, and 25.56s average response time.

What makes Qwen3.6 27B unique: Its total benchmark cost is unusually low for its score range.

Score

7.9

Consistency

8.5

Reliability

10.0

Total Cost (Current Price)

$0.043

Total Output Tokens

21,553

Total Input Tokens

Input Price

$0.500 / 1M

Output Price

$2.000 / 1M

Tests Correct

Wrong Tests: 2

Attempt pass rate: 77.8%

Flaky tests

Flaky tests had mixed outcomes across runs (at least one pass and one fail).

Response Time (avg)

25.56s

Response Time (max): 47.48s

Response Time (total): 153.33s

No answer: 1 Wrong answer: 1

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#37 Qwen3.6 27B

medium

Cost: $0.009
Time: 39.6s
Tokens: 3,090 tok

Run history

Tested on	Score	Reliability	Total Cost	Compare
2026-07-16 22:13 New test added	6.5	10.0	$0.779 ↑	Compare
2026-06-04 13:21 New test added	6.8	10.0	$0.444 ↑	Compare
2026-05-21 23:59 Suite changed	6.6	9.9	$0.272	Compare
2026-04-27 21:48 New test added	7.0	10.0	$0.209	Compare
2026-04-27 21:31 First recorded run	7.9	10.0	$0.043	Current run

Run comparison

Run	Score	Consistency	Reliability	Tests Correct	Flaky tests	Total Output Tokens	Total Cost	Response Time (avg)
2026-04-27 21:31 · First recorded run	7.9	8.5	10.0	4/6	1	21,553	$0.043	25.56s
2026-05-21 23:59 · Suite changed	6.6	8.1	9.9	9/20	5	118,704	$0.272	57.65s
Difference	+1.3	+0.4	+0.1	-5	-4	-97151	-$0.229	-32096ms