Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

inclusionAI: Ring-2.6-1T vs xAI: Grok Build 0.1

Last updated at: 2026-05-21

Metric Ring-2.6-1T Ring-2.6-1T none Release: 2026-05-10 Grok Build 0.1 Grok Build 0.1 medium Release: 2026-05-21
Score 7.2 7.8
Rank #63 #41
Reliability 9.8 10.0
Consistency 9.1 8.9
Tests Correct
Attempt pass rate 62.5% 71.9%
Flaky tests 2 3
Total Runs 57 57
Cost per result 0.000 4.064
Total Cost $0.000 $0.488
Input Price $0.075 / 1M $1.000 / 1M
Output Price $0.625 / 1M $2.000 / 1M
Output Tokens 39,954 1,947
Reasoning Tokens 0 223,372
Response Time (avg) 55.10s 22.28s
Response Time (max) 143.82s 88.28s
Response Time (total) 881.55s 423.30s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 9.2 8.4 91.7% 1 43.33s 5,575 0
Grok Build 0.1 10.0 10.0 100.0% 0 5.46s 195 9,825
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 10.0 10.0 100.0% 0 143.82s 5,036 0
Grok Build 0.1 7.3 3.7 66.7% 1 30.98s 354 17,734
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 0.0 0.0 0.0% 0 0ms 0 0
Grok Build 0.1 10.0 10.0 100.0% 0 30.81s 231 18,779
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 3.0 10.0 0.0% 0 45.87s 1,529 0
Grok Build 0.1 10.0 10.0 100.0% 0 7.76s 180 10,343
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 5.3 7.2 44.4% 1 73.40s 17,728 0
Grok Build 0.1 5.3 10.0 33.3% 0 77.75s 501 111,807
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 4.3 10.0 0.0% 0 15.63s 846 0
Grok Build 0.1 3.8 2.5 33.3% 1 10.14s 78 5,386
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 9.8 10.0 100.0% 0 27.36s 2,004 0
Grok Build 0.1 9.8 10.0 100.0% 0 9.62s 57 12,436
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 7.7 10.0 66.7% 0 31.47s 3,469 0
Grok Build 0.1 6.2 7.5 55.6% 1 8.67s 161 15,476
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 0.0 0.0 0.0% 0 0ms 0 0
Grok Build 0.1 10.0 10.0 100.0% 0 9.40s 180 5,319
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring-2.6-1T 3.0 10.0 0.0% 0 133.60s 3,767 0
Grok Build 0.1 3.0 10.0 0.0% 0 26.07s 10 16,267

Quick Compare

Switch Comparison Pair