Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 2.5 Flash vs Qwen: Qwen3.6 35B A3B

Last updated at: 2026-06-01

Metric Gemini 2.5 Flash Gemini 2.5 Flash medium Release: 2025-06-17 Qwen3.6 35B A3B Qwen3.6 35B A3B medium Release: 2026-04-20
Score 7.7 7.8
Rank #40 #34
Reliability 10.0 10.0
Consistency 9.6 9.5
Tests Correct
Attempt pass rate 68.3% 68.5%
Flaky tests 1 1
Total Runs 60 60
Cost per result 2.750 1.048
Total Cost $0.358 $0.130
Input Price $0.300 / 1M $0.140 / 1M
Output Price $2.500 / 1M $1.000 / 1M
Output Tokens 1,924 18,304
Reasoning Tokens 137,255 115,531
Response Time (avg) 15.57s 17.26s
Response Time (max) 95.48s 86.11s
Response Time (total) 311.47s 310.65s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 8.4 10.0 75.0% 0 6.30s 255 10,233
Qwen3.6 35B A3B 10.0 10.0 100.0% 0 6.02s 1,154 12,385
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 6.6 10.0 50.0% 0 54.56s 537 24,413
Qwen3.6 35B A3B 6.6 10.0 50.0% 0 59.35s 6,601 22,535
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 10.0 10.0 100.0% 0 28.44s 303 11,922
Qwen3.6 35B A3B 0.0 0.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 10.0 10.0 100.0% 0 4.06s 279 2,325
Qwen3.6 35B A3B 10.0 10.0 100.0% 0 12.99s 2,591 9,968
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 5.9 7.2 55.6% 1 37.34s 18 80,702
Qwen3.6 35B A3B 5.3 7.2 44.4% 1 22.50s 6,193 39,116
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 4.8 10.0 0.0% 0 4.86s 92 1,899
Qwen3.6 35B A3B 4.4 9.9 0.0% 0 8.66s 129 4,569
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 9.8 10.0 100.0% 0 2.62s 69 1,203
Qwen3.6 35B A3B 10.0 10.0 100.0% 0 7.50s 219 7,404
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 7.7 10.0 66.7% 0 3.18s 126 2,499
Qwen3.6 35B A3B 8.0 10.0 66.7% 0 5.95s 655 9,228
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 10.0 10.0 100.0% 0 6.20s 234 1,140
Qwen3.6 35B A3B 0.0 0.0 0.0% 0 0ms 0 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 2.5 Flash 3.0 10.0 0.0% 0 2.76s 11 919
Qwen3.6 35B A3B 3.0 10.0 0.0% 0 32.90s 762 10,326

Quick Compare

Switch Comparison Pair