Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

inclusionAI: Ring-2.6-1T vs OpenAI: GPT-5.4

Last updated at: 2026-06-03

Metric Ring-2.6-1T Ring-2.6-1T medium Release: 2026-05-10 GPT-5.4 GPT-5.4 none Release: 2026-03-05
Score 7.0 5.6
Rank #74 #121
Reliability 10.0 10.0
Consistency 8.7 9.1
Tests Correct
Attempt pass rate 63.3% 38.3%
Flaky tests 3 2
Total Runs 60 60
Cost per result 0.000 1.644
Total Cost $0.033 $0.116
Input Price $0.075 / 1M $2.500 / 1M
Output Price $0.625 / 1M $15.000 / 1M
Total Input Tokens 35,892 31,593
Output Tokens 21,752 2,402
Reasoning Tokens 42,754 0
Response Time (avg) 61.29s 1.45s
Response Time (max) 304.19s 2.95s
Response Time (total) 1164.50s 29.00s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 42.21s 810 3,833 4,891
GPT-5.4 3.2 8.0 8.3% 1 1.21s 606 406 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 6.5 10.0 50.0% 0 59.65s 834 1,369 3,985
GPT-5.4 6.8 10.0 50.0% 0 1.99s 4,686 501 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 304.19s 14,823 324 6,088
GPT-5.4 3.0 10.0 0.0% 0 2.89s 11,019 291 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 6.5 10.0 50.0% 0 37.36s 8,046 840 1,937
GPT-5.4 10.0 10.0 100.0% 0 1.04s 7,140 222 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 3.5 4.4 33.3% 2 64.92s 873 9,744 15,013
GPT-5.4 5.3 7.2 44.4% 1 1.07s 723 50 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 4.1 10.0 0.0% 0 58.26s 561 150 583
GPT-5.4 4.4 9.9 0.0% 0 1.78s 477 184 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 9.8 10.0 100.0% 0 11.78s 774 266 1,831
GPT-5.4 6.5 10.0 50.0% 0 1.07s 660 81 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 5.9 7.2 55.6% 1 20.73s 792 697 2,479
GPT-5.4 5.6 9.8 33.3% 0 1.44s 642 381 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 104.44s 8,136 234 1,531
GPT-5.4 10.0 10.0 100.0% 0 2.75s 5,445 246 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 3.0 10.0 0.0% 0 113.91s 243 4,295 4,416
GPT-5.4 3.0 10.0 0.0% 0 990ms 195 40 0

Quick Compare

Switch Comparison Pair