Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

inclusionAI: Ring-2.6-1T vs OpenAI: gpt-oss-120b

Summary

Ring-2.6-1T vs gpt-oss-120b benchmark comparison: Ring-2.6-1T leads on average score with 6.8 vs 6.7. gpt-oss-120b has the lower benchmark cost at $0.011 vs $0.033. gpt-oss-120b is faster at 22.28s vs 61.29s, with pass rates of 60.3% vs 52.4%.

Recommended model: gpt-oss-120b - Its score stays close to the best score here (6.7 vs 6.8), while costing about 3.3x less than Ring-2.6-1T.

Last updated at: 2026-07-02

Metric Ring-2.6-1T Ring-2.6-1T medium Release: 2026-05-10 gpt-oss-120b gpt-oss-120b medium Release: 2025-08-05 Free Available
Score 6.8 6.7
Rank #75 #81
Reliability 10.0 10.0
Consistency 8.8 8.0
Tests Correct
Attempt pass rate 60.3% 52.4%
Flaky tests 3 5
Total Runs 63 63
Cost per result 0.000 0.141
Total Cost $0.033 $0.011
Input Price $0.075 / 1M $0.030 / 1M
Output Price $0.625 / 1M $0.150 / 1M
Total Input Tokens 35,892 39,084
Output Tokens 21,752 20,013
Reasoning Tokens 42,754 50,233
Response Time (avg) 61.29s 22.28s
Response Time (max) 304.19s 68.16s
Response Time (total) 1164.50s 311.96s

Generation showcase

Hamster playing table tennis

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

#75 Ring-2.6-1T

medium
Ring-2.6-1T is no longer available as a free model. It has transitioned to a paid model. Continue using it here: https://openrouter.ai/inclusionai/ring-2.6-1t
Cost
$0.000
Time
0.1s
Tokens
0 tok

#81 gpt-oss-120b

medium
Cost
$0.001
Time
26.7s
Tokens
555 tok

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 42.21s 810 3,833 4,891
gpt-oss-120b 6.7 9.9 50.0% 0 10.21s 1,314 3,518 2,177
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 5.3 10.0 33.3% 0 59.65s 834 1,369 3,985
gpt-oss-120b 5.9 7.0 55.6% 1 38.37s 7,782 3,365 11,973
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 304.19s 14,823 324 6,088
gpt-oss-120b 10.0 10.0 100.0% 0 31.18s 11,535 694 5,072
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 6.5 10.0 50.0% 0 37.36s 8,046 840 1,937
gpt-oss-120b 6.4 5.9 66.7% 1 1.98s 7,476 241 1,114
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 3.5 4.4 33.3% 2 64.92s 873 9,744 15,013
gpt-oss-120b 2.9 4.4 22.2% 2 50.92s 1,266 6,784 20,606
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 4.1 10.0 0.0% 0 58.26s 561 150 583
gpt-oss-120b 4.3 10.0 0.0% 0 7.90s 659 107 387
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 9.8 10.0 100.0% 0 11.78s 774 266 1,831
gpt-oss-120b 9.9 10.0 100.0% 0 7.63s 1,036 126 1,799
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 5.9 7.2 55.6% 1 20.73s 792 697 2,479
gpt-oss-120b 5.3 7.2 44.4% 1 21.71s 1,190 1,790 2,264
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 104.44s 8,136 234 1,531
gpt-oss-120b 9.8 10.0 100.0% 0 6.91s 6,514 287 1,083
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Input Tokens Output Tokens Reasoning Tokens
Ring-2.6-1T 3.0 10.0 0.0% 0 113.91s 243 4,295 4,416
gpt-oss-120b 3.0 10.0 0.0% 0 26.51s 312 3,101 3,758

Quick Compare

Switch Comparison Pair