AI BENCHY Compare
Cobuddy vs OpenAI: GPT-5.4
Last updated at: 2026-05-06
| Metric | Cobuddy Cobuddy medium Free Available | GPT-5.4 GPT-5.4 none |
|---|---|---|
| Score | 6.0 | 5.9 |
| Rank | #90 | #92 |
| Reliability | 9.9 | N/A |
| Consistency | 6.7 | 9.1 |
| Tests Correct | ||
| Attempt pass rate | 57.4% | 42.6% |
| Flaky tests | 7 | 2 |
| Total Runs | 54 | 54 |
| Cost per result | 0.000 | 1.477 |
| Total Cost | $0.000 | $0.104 |
| Input Price | $0.000 / 1M | $2.500 / 1M |
| Output Price | $0.000 / 1M | $15.000 / 1M |
| Output Tokens | 1,639 | 2,317 |
| Reasoning Tokens | 89,199 | 0 |
| Response Time (avg) | 36.47s | 1.51s |
| Response Time (max) | 309.02s | 2.95s |
| Response Time (total) | 656.47s | 27.21s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
CobuddymediumFree AvailablevsOwl AlphanoneCobuddymediumFree AvailablevsMiMo-V2-PrononeCobuddymediumFree AvailablevsGLM 4.7 FlashnoneCobuddymediumFree AvailablevsQwen3.5-27BnoneCobuddymediumFree AvailablevsQwen3.6 27BnoneCobuddymediumFree AvailablevsQwen3.5-35B-A3BnoneCobuddymediumFree AvailablevsKimi K2.6noneCobuddymediumFree AvailablevsGLM 5.1noneCobuddymediumFree AvailablevsQwen3.5 Plus 2026-04-20noneCobuddymediumFree AvailablevsDeepSeek V3.2noneCobuddymediumFree AvailablevsMiMo-V2.5-PrononeCobuddymediumFree AvailablevsQwen3.6 Flashnone