Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Qwen: Qwen3.5 Plus 2026-02-15 vs Qwen: Qwen3.6 Plus

Summary

Qwen3.5 Plus 2026-02-15 vs Qwen3.6 Plus benchmark comparison: Qwen3.5 Plus 2026-02-15 leads on average score with 8.0 vs 7.8. Qwen3.6 Plus has the lower benchmark cost at $0.294 vs $0.310. Qwen3.6 Plus is faster at 30.70s vs 73.79s, with pass rates of 73.0% vs 69.8%.

Recommended model: Qwen3.6 Plus - Its score stays close to the best score here (7.8 vs 8.0), while responding about 2.4x faster than Qwen3.5 Plus 2026-02-15.

Last updated at: 2026-07-02

Metric Qwen3.5 Plus 2026-02-15 Qwen3.5 Plus 2026-02-15 medium Release: 2026-02-15 Qwen3.6 Plus Qwen3.6 Plus medium Release: 2026-04-20
Score 8.0 7.8
Rank #28 #31
Reliability 10.0 10.0
Consistency 8.8 9.3
Tests Correct
Attempt pass rate 73.0% 69.8%
Flaky tests 3 2
Total Runs 63 63
Cost per result 2.445 0.831
Total Cost $0.310 $0.294
Input Price $0.260 / 1M $0.325 / 1M
Output Price $1.560 / 1M $1.950 / 1M
Total Input Tokens 40,918 41,565
Output Tokens 2,159 1,853
Reasoning Tokens 189,604 141,973
Response Time (avg) 73.79s 30.70s
Response Time (max) 266.69s 201.68s
Response Time (total) 1033.07s 613.99s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#28 Qwen3.5 Plus 2026-02-15

medium
Cost
$0.011
Time
125.5s
Tokens
7,040 tok

#31 Qwen3.6 Plus

medium
Cost
$0.024
Time
219.0s
Tokens
12,235 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 8.2 7.9 83.3% 1 45.78s 672 205 21,236
Qwen3.6 Plus 10.0 10.0 100.0% 0 9.90s 672 207 7,557
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 6.6 7.1 44.4% 1 180.70s 6,950 420 80,595
Qwen3.6 Plus 6.1 7.8 44.4% 1 153.12s 7,098 58 50,586
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 46.85s 14,934 421 7,906
Qwen3.6 Plus 10.0 10.0 100.0% 0 34.95s 14,934 452 13,073
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 46.91s 7,782 270 14,916
Qwen3.6 Plus 10.0 10.0 100.0% 0 14.95s 7,782 270 10,706
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 5.3 10.0 33.3% 0 17.50s 444 35 16,680
Qwen3.6 Plus 2.9 7.2 11.1% 1 29.59s 771 56 33,464
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 4.7 1.6 66.7% 1 79.86s 344 73 8,675
Qwen3.6 Plus 5.1 10.0 0.0% 0 27.05s 516 111 5,232
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 31.93s 699 101 7,704
Qwen3.6 Plus 10.0 10.0 100.0% 0 7.54s 699 102 5,552
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 32.50s 696 301 13,853
Qwen3.6 Plus 10.0 10.0 100.0% 0 6.34s 696 309 6,712
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 10.0 10.0 100.0% 0 7.54s 8,193 309 909
Qwen3.6 Plus 10.0 10.0 100.0% 0 5.87s 8,193 267 1,330
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Qwen3.5 Plus 2026-02-15 3.0 10.0 0.0% 0 103.81s 204 24 17,130
Qwen3.6 Plus 3.0 10.0 0.0% 0 47.51s 204 21 7,761

Quick Compare

Switch Comparison Pair