Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite vs Qwen: Qwen3.5-35B-A3B

Last updated at: 2026-05-08

Metric Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite minimal Release: 2026-05-08 Qwen3.5-35B-A3B Qwen3.5-35B-A3B medium Release: 2026-02-24
Score 6.8 7.2
Rank #68 #57
Reliability 10.0 6.7
Consistency 8.7 6.9
Tests Correct
Attempt pass rate 59.7% 75.4%
Flaky tests 3 7
Total Runs 57 57
Cost per result 0.111 4.806
Total Cost $0.012 $0.481
Input Price $0.250 / 1M $0.140 / 1M
Output Price $1.500 / 1M $1.000 / 1M
Output Tokens 2,457 21,056
Reasoning Tokens 0 280,814
Response Time (avg) 1.41s 51.50s
Response Time (max) 4.49s 177.35s
Response Time (total) 26.72s 978.57s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 8.3 10.0 75.0% 0 1.10s 639 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 21.13s 798 42,652
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.31s 636 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 79.09s 4,273 33,078
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 2.53s 357 0
Qwen3.5-35B-A3B 4.7 1.6 66.7% 1 75.34s 775 12,485
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 1.04s 279 0
Qwen3.5-35B-A3B 7.3 5.9 83.3% 1 59.33s 235 19,493
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 2.9 7.2 11.1% 1 1.02s 15 0
Qwen3.5-35B-A3B 4.1 4.4 44.5% 2 88.34s 41 46,368
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 4.0 10.0 0.0% 0 791ms 63 0
Qwen3.5-35B-A3B 2.8 1.6 33.3% 1 30.30s 20 3,753
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 932ms 72 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 24.45s 97 17,361
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 6.0 4.6 66.7% 2 2.15s 153 0
Qwen3.5-35B-A3B 6.4 4.4 77.8% 2 31.58s 3,589 32,206
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 10.0 10.0 100.0% 0 3.51s 234 0
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 4.65s 309 1,365
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite 3.0 10.0 0.0% 0 724ms 9 0
Qwen3.5-35B-A3B 3.0 10.0 0.0% 0 177.35s 10,919 72,053

Quick Compare

Switch Comparison Pair