AI BENCHY Compare
Cobuddy vs xAI: Grok 4.20
Last updated at: 2026-05-10
| Metric | Cobuddy Cobuddy medium Free Available | Grok 4.20 Grok 4.20 none |
|---|---|---|
| Score | 5.8 | 5.4 |
| Rank | #97 | #115 |
| Reliability | 9.9 | N/A |
| Consistency | 6.9 | 9.5 |
| Tests Correct | ||
| Attempt pass rate | 54.4% | 35.2% |
| Flaky tests | 7 | 1 |
| Total Runs | 57 | 54 |
| Cost per result | 0.000 | 1.574 |
| Total Cost | $0.000 | $0.095 |
| Input Price | $0.000 / 1M | $1.250 / 1M |
| Output Price | $0.000 / 1M | $2.500 / 1M |
| Output Tokens | 1,648 | 1,967 |
| Reasoning Tokens | 96,062 | 0 |
| Response Time (avg) | 36.50s | 1.11s |
| Response Time (max) | 309.02s | 6.04s |
| Response Time (total) | 693.45s | 20.02s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
CobuddymediumFree AvailablevsQwen3.6 27BnoneCobuddymediumFree AvailablevsOwl AlphanoneCobuddymediumFree AvailablevsQwen3.5-27BnoneCobuddymediumFree AvailablevsMiMo-V2-PrononeCobuddymediumFree AvailablevsGLM 4.7 FlashnoneElephant AlphamediumvsGrok 4.20noneCobuddymediumFree AvailablevsQwen3.5-35B-A3BnoneCobuddymediumFree AvailablevsGPT-5.4noneCobuddymediumFree AvailablevsQwen3.5 Plus 2026-04-20noneCobuddymediumFree AvailablevsKimi K2.6noneMistral Small 4mediumvsGrok 4.20noneCobuddymediumFree AvailablevsGLM 5.1none