Navigate
AI BENCHY
Compare Charts Methodology
❤️ Made by XCS
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Trinity Large Preview vs Google: Gemini 3 Flash Preview

Compare:

Last updated at: 2026-03-06

Metric Trinity Large Preview none Release: 2026-01-27 Free Available Google: Gemini 3 Flash Preview medium Release: 2025-12-17
Rank #45 #1
Avg Score 4.2 10.0
Consistency 9.6 10.0
Cost per result 0.000 1.025
Total Cost $0.000 $0.164
Tests Correct
Attempt pass rate 33.3% 100.0%
Flaky tests 1 0
Total Runs 48 (16 x 3) 48 (16 x 3)
Output Tokens 1,837 1,634
Reasoning Tokens 0 47,907
Response Time (avg) 3.15s 12.36s
Response Time (max) 8.91s 50.16s
Response Time (total) 50.46s 111.21s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 0.0% 0 3.59s 587 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 5.61s 299 3,127
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 0.0% 0 8.91s 294 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 50.16s 351 12,645
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 9.9 10.0 100.0% 0 3.26s 186 0
Google: Gemini 3 Flash Preview 9.9 10.0 100.0% 0 4.72s 279 5,333
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 4.0 10.0 33.3% 0 877ms 25 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 21.12s 12 14,908
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 3.0 9.9 0.0% 0 2.86s 124 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.09s 111 1,285
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 3.5 6.7 16.7% 1 1.09s 63 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 6.10s 72 4,558
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 4.0 10.0 33.3% 0 3.30s 291 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 4.43s 276 4,921
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 100.0% 0 6.67s 267 0
Google: Gemini 3 Flash Preview 10.0 10.0 100.0% 0 10.55s 234 1,130

Quick Compare

Switch Comparison Pair