Navigate
AI BENCHY
Compare Charts Methodology
❤️ Made by XCS
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Trinity Large Preview vs OpenAI: GPT-5.2

Compare:

Last updated at: 2026-03-06

Metric Trinity Large Preview none Release: 2026-01-27 Free Available OpenAI: GPT-5.2 medium Release: 2025-12-11
Rank #45 #27
Avg Score 4.2 6.5
Consistency 9.6 7.9
Cost per result 0.000 3.125
Total Cost $0.000 $0.313
Tests Correct
Attempt pass rate 33.3% 75.0%
Flaky tests 1 4
Total Runs 48 48
Output Tokens 1,837 2,220
Reasoning Tokens 0 16,811
Response Time (avg) 3.15s 15.33s
Response Time (max) 8.91s 77.80s
Response Time (total) 50.46s 138.01s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Avg Score vs Response Time (avg)

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 0.0% 0 3.59s 587 0
OpenAI: GPT-5.2 7.0 7.3 77.8% 1 14.34s 549 2,002
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 0.0% 0 8.91s 294 0
OpenAI: GPT-5.2 10.0 10.0 100.0% 0 14.06s 291 1,757
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 9.9 10.0 100.0% 0 3.26s 186 0
OpenAI: GPT-5.2 9.9 10.0 100.0% 0 3.15s 234 420
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 4.0 10.0 33.3% 0 877ms 25 0
OpenAI: GPT-5.2 4.0 7.2 55.6% 1 77.80s 42 10,342
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 3.0 9.9 0.0% 0 2.86s 124 0
OpenAI: GPT-5.2 10.0 9.7 0.0% 0 4.32s 162 269
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 3.5 6.7 16.7% 1 1.09s 63 0
OpenAI: GPT-5.2 9.5 10.0 100.0% 0 3.12s 94 614
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 4.0 10.0 33.3% 0 3.30s 291 0
OpenAI: GPT-5.2 7.0 7.3 77.8% 1 5.47s 609 938
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Trinity Large Preview 10.0 10.0 100.0% 0 6.67s 267 0
OpenAI: GPT-5.2 10.0 1.6 66.7% 1 10.30s 239 469

Quick Compare

Switch Comparison Pair