Navigate
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

Anthropic: Claude Sonnet 4.6 vs inclusionAI: Ring-2.6-1T

Last updated at: 2026-05-19

Metric Claude Sonnet 4.6 Claude Sonnet 4.6 medium Release: 2026-02-17 Ring-2.6-1T Ring-2.6-1T none Release: 2026-05-10
Score 7.8 7.2
Rank #40 #62
Reliability 10.0 9.8
Consistency 9.6 9.1
Tests Correct
Attempt pass rate 70.2% 62.5%
Flaky tests 1 2
Total Runs 57 57
Cost per result 9.515 0.000
Total Cost $1.237 $0.000
Input Price $3.000 / 1M $0.075 / 1M
Output Price $15.000 / 1M $0.625 / 1M
Output Tokens 45,505 39,954
Reasoning Tokens 28,370 0
Response Time (avg) 14.25s 55.10s
Response Time (max) 46.35s 143.82s
Response Time (total) 156.71s 881.55s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 6.5 10.0 50.0% 0 2.98s 1,046 1,093
Ring-2.6-1T 9.2 8.4 91.7% 1 43.33s 5,575 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 35.76s 6,894 2,097
Ring-2.6-1T 10.0 10.0 100.0% 0 143.82s 5,036 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 46.35s 5,871 3,962
Ring-2.6-1T 0.0 0.0 0.0% 0 0ms 0 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 13.90s 649 742
Ring-2.6-1T 3.0 10.0 0.0% 0 45.87s 1,529 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 2.9 7.2 11.1% 1 0ms 25,790 16,919
Ring-2.6-1T 5.3 7.2 44.4% 1 73.40s 17,728 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 4.94s 256 433
Ring-2.6-1T 4.3 10.0 0.0% 0 15.63s 846 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 2.61s 318 552
Ring-2.6-1T 9.8 10.0 100.0% 0 27.36s 2,004 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 4.80s 589 635
Ring-2.6-1T 7.7 10.0 66.7% 0 31.47s 3,469 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 10.0 10.0 100.0% 0 7.48s 655 351
Ring-2.6-1T 0.0 0.0 0.0% 0 0ms 0 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Claude Sonnet 4.6 3.0 10.0 0.0% 0 30.09s 3,437 1,586
Ring-2.6-1T 3.0 10.0 0.0% 0 133.60s 3,767 0

Quick Compare

Switch Comparison Pair