Gemini 3.1 Flash Lite (low) vs Qwen3.5 Plus 2026-02-15

Recommended model Qwen3.5 Plus 2026-02-15

It has the best score here (6.4), while costing about 8.6x less than Gemini 3.1 Flash Lite (low).

Detailed comparison

Metric	Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite low Release: 2026-05-08	Qwen3.5 Plus 2026-02-15 Qwen3.5 Plus 2026-02-15 none Release: 2026-02-15

Metric	Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite low Release: 2026-05-08	Qwen3.5 Plus 2026-02-15 Qwen3.5 Plus 2026-02-15 none Release: 2026-02-15
Score	6.5	6.4
Rank	#118	#120
Reliability	10.0	10.0
Consistency	9.2	9.4
Tests Correct
Attempt pass rate	59.1%	48.5%
Flaky tests	2	2
Total Runs	66	66
Cost per result	5.170	0.751
Total Cost	$0.621	$0.073
Input Price	$0.250 / 1M	$0.260 / 1M
Output Price	$1.500 / 1M	$1.560 / 1M
Total Input Tokens	94,224	102,646
Output Tokens	7,759	29,370
Reasoning Tokens	390,126	0
Response Time (avg)	16.26s	9.85s
Response Time (max)	318.02s	123.00s
Response Time (total)	357.64s	157.63s

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

low

none

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	7.3	6.2	75.0%	2		1.84s	500	1,013	1,548
Qwen3.5 Plus 2026-02-15	4.8	10.0	25.0%	0		1.91s	696	517	0

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	5.5	10.0	33.3%	0		1.53s	8,132	471	1,072
Qwen3.5 Plus 2026-02-15	4.3	7.9	11.1%	1		2.05s	7,913	473	0

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	3.2	9.1	0.0%	0		161.25s	70,202	5,375	381,841
Qwen3.5 Plus 2026-02-15	6.5	10.0	50.0%	0		64.83s	75,086	27,204	0

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	10.0	10.0	100.0%	0		1.44s	7,453	291	697
Qwen3.5 Plus 2026-02-15	10.0	10.0	100.0%	0		1.89s	7,794	243	0

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	5.3	10.0	33.3%	0		1.52s	639	15	1,214
Qwen3.5 Plus 2026-02-15	5.3	10.0	33.3%	0		1.17s	789	17	0

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	4.0	10.0	0.0%	0		1.37s	492	69	438
Qwen3.5 Plus 2026-02-15	4.4	3.0	33.3%	1		2.26s	522	117	0

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	10.0	10.0	100.0%	0		1.52s	619	72	760
Qwen3.5 Plus 2026-02-15	10.0	10.0	100.0%	0		1.67s	711	72	0

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	10.0	10.0	100.0%	0		1.40s	570	210	1,191
Qwen3.5 Plus 2026-02-15	7.7	10.0	66.7%	0		2.71s	714	494	0

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	10.0	10.0	100.0%	0		5.66s	5,457	234	945
Qwen3.5 Plus 2026-02-15	10.0	10.0	100.0%	0		3.33s	8,211	222	0

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Gemini 3.1 Flash Lite	3.0	10.0	0.0%	0		1.46s	160	9	420
Qwen3.5 Plus 2026-02-15	3.0	10.0	0.0%	0		1.11s	210	11	0

Switch Comparison Pair