Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

Google: Gemini 3.1 Flash Lite Preview vs Qwen: Qwen3.5-35B-A3B

Last updated at: 2026-03-15

Metric Gemini 3.1 Flash Lite Preview Gemini 3.1 Flash Lite Preview low Release: 2026-03-03 Qwen3.5-35B-A3B Qwen3.5-35B-A3B medium Release: 2026-02-24
Rank #21 #33
Score 7.9 7.1
Consistency 10.0 6.3
Cost per result 0.177 4.251
Total Cost $0.020 $0.341
Tests Correct
Attempt pass rate 68.8% 77.1%
Flaky tests 0 7
Total Runs 48 48
Output Tokens 1,611 5,495
Reasoning Tokens 7,272 169,266
Response Time (avg) 3.36s 43.93s
Response Time (max) 11.91s 106.00s
Response Time (total) 53.84s 702.85s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 7.7 10.0 66.7% 0 2.18s 456 1,224
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 21.75s 429 36,235
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 3.0 10.0 0.0% 0 11.91s 225 762
Qwen3.5-35B-A3B 4.7 1.6 66.7% 1 75.34s 775 12,485
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 3.00s 291 696
Qwen3.5-35B-A3B 7.3 5.9 83.3% 1 59.33s 235 19,493
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 5.3 10.0 33.3% 0 2.36s 18 1,212
Qwen3.5-35B-A3B 4.1 4.4 44.5% 2 88.34s 41 46,368
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 4.0 10.0 0.0% 0 1.54s 69 384
Qwen3.5-35B-A3B 2.8 1.6 33.3% 1 30.30s 20 3,753
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 1.49s 72 753
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 24.45s 97 17,361
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 2.76s 243 1,248
Qwen3.5-35B-A3B 6.4 4.4 77.8% 2 31.58s 3,589 32,206
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.1 Flash Lite Preview 10.0 10.0 100.0% 0 9.54s 237 993
Qwen3.5-35B-A3B 10.0 10.0 100.0% 0 4.65s 309 1,365

Quick Compare

Switch Comparison Pair