#31 Qwen3.6 Plus
medium- Cost
- $0.024
- Time
- 219.0s
- Tokens
- 12,235 tok
Summary
Qwen3.6 Plus scores 7.8 on AI BENCHY and ranks #31. It has 10.0 reliability, a 69.8% pass rate, $0.294 total cost, and 30.70s average response time.
Identity note
Qwen3.6 Plus Preview was the preview version of Qwen3.6 Plus.
7.8
Consistency
9.3
10.0
Total Output Tokens
143,826
Total Input Tokens
41,565
Input Price
$0.325 / 1M
Output Price
$1.950 / 1M
Flaky tests
2
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Generation showcase
Prompt: Create a detailed SVG illustration of a hamster playing table tennis.
Run history
| Tested on | Score | Reliability | Tests Correct | Total Cost | Compare |
|---|---|---|---|---|---|
| 2026-06-04 13:30 New test added | 7.9 | 10.0 | $0.294 ↑ | Current run | |
| 2026-05-22 00:01 Re-test | 7.8 | 10.0 | $0.082 | Compare | |
| 2026-04-11 01:44 First recorded run | 8.1 | N/A | $0.000 | Compare |
This run used a different benchmark suite. Keep suite changes in mind when reading historical movement.
Run comparison
| Run | Score | Consistency | Reliability | Tests Correct | Flaky tests | Total Output Tokens | Total Input Tokens | Total Cost | Response Time (avg) |
|---|---|---|---|---|---|---|---|---|---|
| 2026-06-04 13:30 · Current run | 7.8 | 9.3 | 10.0 | 14/21 | 2 | 143,826 | 41,565 | $0.294 | 30.70s |
| 2026-04-11 01:44 · First recorded run | 8.1 | 9.5 | N/A | 13/18 | 1 | 85,545 | 0 | $0.000 | 15.27s |
| Difference | -0.3 | -0.2 | +1 | +1 | +58281 | +41565 | +$0.294 | +15431ms |
These two runs used different benchmark suites, so the deltas reflect both model changes and suite changes.
Price History
Historical pricing data for this model from OpenRouter.
| Date | Input Price | Output Price |
|---|---|---|
| 2026-06-04 15:40 | $0.325 / 1M | $1.950 / 1M |
Choose the first model, then click a second model to open a side-by-side page.
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 10.0 | 10.0 | |
| Coding | 6.1 | 7.8 | |
| Combined | 10.0 | 10.0 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 2.9 | 7.2 | |
| General Intelligence | 5.1 | 10.0 | |
| Instructions following | 10.0 | 10.0 | |
| Puzzle Solving | 10.0 | 10.0 | |
| Tool Calling | 10.0 | 10.0 | |
| Trivia | 3.0 | 10.0 |