Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Google: Gemini 3.5 Flash vs Hunter Alpha

Last updated at: 2026-05-22

Metric Gemini 3.5 Flash Gemini 3.5 Flash low Release: 2026-05-19 Hunter Alpha Hunter Alpha none Release: 2026-03-11
Score 9.3 5.7
Rank #3 #108
Reliability 10.0 N/A
Consistency 10.0 8.2
Tests Correct
Attempt pass rate 90.0% 46.3%
Flaky tests 0 4
Total Runs 60 52
Cost per result 1.582 0.000
Total Cost $0.285 $0.000
Input Price $1.500 / 1M $0.000 / 1M
Output Price $9.000 / 1M $0.000 / 1M
Output Tokens 2,027 2,278
Reasoning Tokens 23,938 0
Response Time (avg) 2.98s 4.58s
Response Time (max) 6.44s 15.17s
Response Time (total) 59.59s 77.92s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.52s 209 2,536
Hunter Alpha 3.5 8.0 16.7% 1 3.81s 779 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 6.8 10.0 50.0% 0 5.54s 452 6,839
Hunter Alpha 3.0 10.0 0.0% 0 0ms 0 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 6.44s 351 3,050
Hunter Alpha 3.0 10.0 0.0% 0 15.17s 379 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.81s 279 1,164
Hunter Alpha 10.0 10.0 100.0% 0 8.49s 249 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 7.7 10.0 66.7% 0 3.39s 12 4,538
Hunter Alpha 5.3 10.0 33.3% 0 2.33s 27 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.27s 119 916
Hunter Alpha 6.1 3.1 66.7% 1 2.71s 91 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 9.9 10.0 100.0% 0 1.86s 71 1,652
Hunter Alpha 6.4 10.0 50.0% 0 2.82s 69 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 2.35s 288 2,150
Hunter Alpha 5.8 4.4 66.7% 2 3.06s 349 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 3.27s 234 403
Hunter Alpha 10.0 10.0 100.0% 0 6.02s 335 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Gemini 3.5 Flash 10.0 10.0 100.0% 0 1.88s 12 690
Hunter Alpha - - - - - - - -

Quick Compare

Switch Comparison Pair