Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Google: Gemini 3 Flash Preview vs Qwen: Qwen3.5-27B

Last updated at: 2026-05-29

Metric Gemini 3 Flash Preview Gemini 3 Flash Preview none Release: 2025-12-17 Qwen3.5-27B Qwen3.5-27B medium Release: 2026-02-24
Score 7.7 7.9
Rank #44 #28
Reliability 10.0 10.0
Consistency 9.2 8.9
Tests Correct
Attempt pass rate 70.0% 73.3%
Flaky tests 2 3
Total Runs 60 60
Cost per result 0.175 4.532
Total Cost $0.023 $0.590
Input Price $0.500 / 1M $0.195 / 1M
Output Price $3.000 / 1M $1.560 / 1M
Output Tokens 1,879 2,569
Reasoning Tokens 0 304,894
Response Time (avg) 1.70s 60.09s
Response Time (max) 3.56s 177.36s
Response Time (total) 22.05s 1201.89s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 8.3 10.0 75.0% 0 1.25s 214 0
Qwen3.5-27B 8.7 7.9 91.7% 1 19.75s 569 31,505
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 6.8 10.0 50.0% 0 2.19s 447 0
Qwen3.5-27B 7.0 9.8 50.0% 0 123.86s 416 64,993
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 4.7 1.6 66.7% 1 3.56s 350 0
Qwen3.5-27B 10.0 10.0 100.0% 0 163.96s 483 9,991
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 1.41s 279 0
Qwen3.5-27B 10.0 10.0 100.0% 0 30.26s 270 16,150
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 7.7 10.0 66.7% 0 963ms 18 0
Qwen3.5-27B 5.3 10.0 33.3% 0 79.53s 43 52,368
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 1.13s 104 0
Qwen3.5-27B 6.1 3.1 66.7% 1 101.41s 70 23,147
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 6.4 5.8 66.7% 1 1.58s 74 0
Qwen3.5-27B 10.0 10.0 100.0% 0 19.66s 97 11,638
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 7.7 10.0 66.7% 0 1.05s 144 0
Qwen3.5-27B 8.2 7.7 77.8% 1 59.60s 242 70,096
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.35s 234 0
Qwen3.5-27B 10.0 10.0 100.0% 0 7.45s 348 1,323
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 3.0 10.0 0.0% 0 1.07s 15 0
Qwen3.5-27B 3.0 10.0 0.0% 0 85.11s 31 23,683

Quick Compare

Switch Comparison Pair