Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

Ring 2.6 1t vs xAI: Grok 4.20

Last updated at: 2026-05-10

Metric Ring 2.6 1t Ring 2.6 1t none Release: 2026-05-10 Free Available Grok 4.20 Grok 4.20 medium Release: 2026-03-31
Score 7.2 6.9
Rank #57 #68
Reliability 9.8 10.0
Consistency 9.1 8.3
Tests Correct
Attempt pass rate 62.5% 63.2%
Flaky tests 2 4
Total Runs 57 57
Cost per result 0.000 7.559
Total Cost $0.000 $0.756
Input Price $0.000 / 1M $1.250 / 1M
Output Price $0.000 / 1M $2.500 / 1M
Output Tokens 39,954 1,784
Reasoning Tokens 0 128,233
Response Time (avg) 55.10s 14.53s
Response Time (max) 143.82s 63.48s
Response Time (total) 881.55s 276.06s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 9.2 8.4 91.7% 1 43.33s 5,575 0
Grok 4.20 8.2 7.9 83.3% 1 3.95s 287 8,312
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 10.0 10.0 100.0% 0 143.82s 5,036 0
Grok 4.20 4.3 1.1 66.7% 1 24.33s 250 12,804
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 0.0 0.0 0.0% 0 0ms 0 0
Grok 4.20 10.0 10.0 100.0% 0 17.40s 232 9,556
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 3.0 10.0 0.0% 0 45.87s 1,529 0
Grok 4.20 10.0 10.0 100.0% 0 4.17s 180 5,333
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 5.3 7.2 44.4% 1 73.40s 17,728 0
Grok 4.20 5.3 10.0 33.3% 0 27.03s 375 49,339
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 4.3 10.0 0.0% 0 15.63s 846 0
Grok 4.20 3.9 2.6 33.3% 1 24.48s 65 6,440
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 9.8 10.0 100.0% 0 27.36s 2,004 0
Grok 4.20 7.3 6.0 83.3% 1 4.42s 40 5,474
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 7.7 10.0 66.7% 0 31.47s 3,469 0
Grok 4.20 7.7 10.0 66.7% 0 6.20s 149 7,913
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 0.0 0.0 0.0% 0 0ms 0 0
Grok 4.20 3.0 10.0 0.0% 0 13.68s 197 6,620
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ring 2.6 1t 3.0 10.0 0.0% 0 133.60s 3,767 0
Grok 4.20 3.0 10.0 0.0% 0 63.48s 9 16,442

Quick Compare

Switch Comparison Pair