Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3 Flash Preview vs LiquidAI: LFM2-24B-A2B

Last updated at: 2026-05-22

Metric Gemini 3 Flash Preview Gemini 3 Flash Preview medium Release: 2025-12-17 LFM2-24B-A2B LFM2-24B-A2B none Release: 2026-02-24
Score 9.8 4.2
Rank #1 #152
Reliability 10.0 N/A
Consistency 9.6 9.0
Tests Correct
Attempt pass rate 98.3% 18.8%
Flaky tests 1 2
Total Runs 60 48
Cost per result 2.985 0.024
Total Cost $0.567 $0.001
Input Price $0.500 / 1M $0.030 / 1M
Output Price $3.000 / 1M $0.120 / 1M
Output Tokens 2,009 1,185
Reasoning Tokens 181,315 0
Response Time (avg) 16.72s 811ms
Response Time (max) 117.26s 2.88s
Response Time (total) 334.36s 11.35s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 3.88s 330 3,216
LFM2-24B-A2B 3.3 9.8 0.0% 0 471ms 490 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 7.9 6.4 83.3% 1 95.96s 456 127,964
LFM2-24B-A2B - - - - - - - -
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 22.42s 351 10,485
LFM2-24B-A2B 3.0 10.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.43s 279 4,893
LFM2-24B-A2B 3.0 10.0 0.0% 0 714ms 219 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 15.27s 12 21,684
LFM2-24B-A2B 5.9 7.2 55.6% 1 287ms 30 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.19s 72 1,905
LFM2-24B-A2B 4.0 10.0 0.0% 0 395ms 72 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.04s 72 2,709
LFM2-24B-A2B 6.3 10.0 50.0% 0 1.09s 60 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.48s 192 4,647
LFM2-24B-A2B 3.7 7.7 11.1% 1 1.69s 314 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 12.60s 234 1,487
LFM2-24B-A2B 3.0 10.0 0.0% 0 0ms 0 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.50s 11 2,325
LFM2-24B-A2B - - - - - - - -

Quick Compare

Switch Comparison Pair