Navigate
AI BENCHY
Your ad here

AI BENCHY Compare

OpenAI: GPT-5 Nano vs Owl Alpha

Last updated at: 2026-04-30

Metric GPT-5 Nano GPT-5 Nano medium Release: 2025-08-07 Owl Alpha Owl Alpha none Release: 2026-04-30
Score 6.4 6.0
Rank #73 #87
Reliability N/A 10.0
Consistency 6.8 9.1
Tests Correct
Attempt pass rate 61.1% 46.3%
Flaky tests 7 2
Total Runs 54 54
Cost per result 0.824 0.000
Total Cost $0.066 $0.000
Input Price $0.050 / 1M $0.000 / 1M
Output Price $0.400 / 1M $0.000 / 1M
Output Tokens 4,980 1,671
Reasoning Tokens 156,288 0
Response Time (avg) 44.13s 7.07s
Response Time (max) 204.02s 32.27s
Response Time (total) 485.47s 127.23s

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Category Breakdown

Anti-AI Tricks Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 6.5 7.9 58.3% 1 25.50s 1,221 21,184
Owl Alpha 3.4 7.9 16.7% 1 2.78s 57 0
Coding Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 6.7 3.5 66.7% 1 40.73s 480 12,992
Owl Alpha 10.0 10.0 100.0% 0 32.27s 450 0
Combined Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 10.0 10.0 100.0% 0 65.96s 578 17,984
Owl Alpha 3.0 10.0 0.0% 0 21.74s 315 0
Data parsing and extraction Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 3.7 1.7 50.0% 2 21.42s 453 10,560
Owl Alpha 10.0 10.0 100.0% 0 3.60s 246 0
Domain specific Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 5.2 4.4 55.6% 2 204.02s 237 64,448
Owl Alpha 5.3 10.0 33.3% 0 3.00s 27 0
General Intelligence Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 4.1 10.0 0.0% 0 17.51s 202 4,608
Owl Alpha 4.3 10.0 0.0% 0 4.61s 80 0
Instructions following Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 9.8 10.0 100.0% 0 11.90s 382 4,096
Owl Alpha 6.4 10.0 50.0% 0 2.63s 63 0
Puzzle Solving Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 5.3 7.2 44.4% 1 19.81s 869 13,440
Owl Alpha 5.9 7.2 55.6% 1 4.43s 202 0
Tool Calling Score Consistency Attempt pass rate Flaky tests Tests Correct Response Time (avg) Output Tokens Reasoning Tokens
GPT-5 Nano 10.0 10.0 100.0% 0 33.30s 558 6,976
Owl Alpha 10.0 10.0 100.0% 0 22.78s 231 0

Quick Compare

Switch Comparison Pair