Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemma 4 26B A4B vs Qwen: Qwen3.6 Max Preview

Summary

Gemma 4 26B A4B vs Qwen3.6 Max Preview benchmark comparison: Qwen3.6 Max Preview leads on average score with 6.0 vs 5.5. Gemma 4 26B A4B has the lower benchmark cost at $0.004 vs $0.075. Qwen3.6 Max Preview is faster at 3.30s vs 5.91s, with pass rates of 44.4% vs 58.7%.

Recommended model: Qwen3.6 Max Preview - It has the best score here (6.0), while responding about 1.8x faster than Gemma 4 26B A4B.

Last updated at: 2026-07-02

Metric Gemma 4 26B A4B Gemma 4 26B A4B none Release: 2026-04-03 Free Available Qwen3.6 Max Preview Qwen3.6 Max Preview none Release: 2026-04-20
Score 5.5 6.0
Rank #125 #103
Reliability 10.0 10.0
Consistency 9.2 9.2
Tests Correct
Attempt pass rate 44.4% 58.7%
Flaky tests 2 2
Total Runs 63 63
Cost per result 0.068 0.824
Total Cost $0.004 $0.075
Input Price $0.060 / 1M $1.040 / 1M
Output Price $0.330 / 1M $6.240 / 1M
Total Input Tokens 40,038 42,509
Output Tokens 1,824 4,779
Reasoning Tokens 0 0
Response Time (avg) 5.91s 3.30s
Response Time (max) 57.10s 20.51s
Response Time (total) 124.05s 69.40s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#125 Gemma 4 26B A4B

none
Cost
$0.001
Time
39.5s
Tokens
790 tok

#103 Qwen3.6 Max Preview

none
Cost
$0.025
Time
83.9s
Tokens
4,066 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 8.3 10.0 75.0% 0 1.28s 852 230 0
Qwen3.6 Max Preview 5.2 7.9 41.7% 1 2.63s 696 513 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.7 7.2 22.2% 1 4.16s 7,736 476 0
Qwen3.6 Max Preview 3.8 7.3 22.2% 1 3.12s 7,913 456 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 30.53s 13,650 309 0
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 20.51s 14,949 2,842 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 1.70s 8,352 285 0
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.87s 7,794 243 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.6 7.2 22.2% 1 2.49s 903 27 0
Qwen3.6 Max Preview 7.7 10.0 66.7% 0 1.22s 789 18 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 4.0 10.0 0.0% 0 3.54s 576 85 0
Qwen3.6 Max Preview 4.3 10.0 0.0% 0 1.62s 522 76 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 6.3 10.0 50.0% 0 690ms 795 75 0
Qwen3.6 Max Preview 9.8 10.0 100.0% 0 1.40s 711 69 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 6.2 10.0 33.3% 0 744ms 828 114 0
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 2.65s 714 321 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 10.0 10.0 100.0% 0 57.10s 6,123 210 0
Qwen3.6 Max Preview 10.0 10.0 100.0% 0 5.27s 8,211 222 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Gemma 4 26B A4B 3.0 10.0 0.0% 0 778ms 223 13 0
Qwen3.6 Max Preview 3.0 10.0 0.0% 0 1.97s 210 19 0

Quick Compare

Switch Comparison Pair