Navigate
AI BENCHY
Advertise here

AI BENCHY Compare

inclusionAI: Ling-2.6-flash vs OpenAI: GPT-5.4

Last updated at: 2026-05-22

Metric Ling-2.6-flash Ling-2.6-flash none Release: 2026-04-21 GPT-5.4 GPT-5.4 none Release: 2026-03-05
Score 5.3 5.6
Rank #128 #112
Reliability 10.0 10.0
Consistency 9.2 9.1
Tests Correct
Attempt pass rate 35.1% 38.3%
Flaky tests 2 2
Total Runs 60 60
Cost per result 0.005 1.638
Total Cost $0.001 $0.115
Input Price $0.010 / 1M $2.500 / 1M
Output Price $0.030 / 1M $15.000 / 1M
Output Tokens 2,878 2,378
Reasoning Tokens 0 0
Response Time (avg) 9.76s 1.46s
Response Time (max) 35.34s 2.95s
Response Time (total) 185.37s 29.23s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 6.8 8.1 58.3% 1 11.81s 573 0
GPT-5.4 3.2 8.0 8.3% 1 1.21s 406 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 10.0 10.0 100.0% 0 11.21s 381 0
GPT-5.4 6.8 10.0 50.0% 0 1.99s 501 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 3.0 10.0 0.0% 0 35.34s 1,069 0
GPT-5.4 3.0 10.0 0.0% 0 2.89s 291 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 6.5 10.0 50.0% 0 8.48s 246 0
GPT-5.4 10.0 10.0 100.0% 0 1.04s 222 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 3.0 10.0 0.0% 0 4.95s 24 0
GPT-5.4 5.3 7.2 44.4% 1 1.07s 50 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 4.0 10.0 0.0% 0 1.45s 109 0
GPT-5.4 4.4 9.9 0.0% 0 1.78s 184 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 9.8 10.0 100.0% 0 5.52s 81 0
GPT-5.4 6.5 10.0 50.0% 0 1.07s 81 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 2.9 7.2 11.1% 1 9.14s 151 0
GPT-5.4 5.6 9.8 33.3% 0 1.52s 357 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 3.0 10.0 0.0% 0 18.80s 229 0
GPT-5.4 10.0 10.0 100.0% 0 2.75s 246 0
Trivia Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
Ling-2.6-flash 3.0 10.0 0.0% 0 1.06s 15 0
GPT-5.4 3.0 10.0 0.0% 0 990ms 40 0

Quick Compare

Switch Comparison Pair