AI BENCHY Compare
Elephant vs Elephant
Last updated at: 2026-04-14
| Metric | Elephant Elephant medium | Elephant Elephant none |
|---|---|---|
| Score | 5.2 | 5.2 |
| Rank | #77 | #81 |
| Consistency | 9.6 | 9.6 |
| Tests Correct | ||
| Attempt pass rate | 29.6% | 31.5% |
| Flaky tests | 1 | 1 |
| Total Runs | 54 | 54 |
| Cost per result | 0.000 | 0.000 |
| Total Cost | $0.000 | $0.000 |
| Input Price | $0.000 / 1M | $0.000 / 1M |
| Output Price | $0.000 / 1M | $0.000 / 1M |
| Output Tokens | 2,596 | 2,573 |
| Reasoning Tokens | 0 | 0 |
| Response Time (avg) | 1.27s | 1.23s |
| Response Time (max) | 3.70s | 3.81s |
| Response Time (total) | 22.82s | 22.16s |
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Category Breakdown
Quick Compare
Switch Comparison Pair
ElephantmediumvsGrok 4.20noneMistral Small 4nonevsElephantmediumgpt-oss-120bnoneFree AvailablevsElephantmediumMiniMax M2.7mediumvsElephantnoneTrinity Large PreviewnoneFree AvailablevsElephantmediumGPT-5.4 MininonevsElephantmediumElephantmediumvsQwen3 Coder NextnoneNemotron 3 SupernoneFree AvailablevsElephantmediumElephantmediumvsGLM 5 TurbononeKimi K2.5nonevsElephantmediumElephantmediumvsGLM 5.1noneElephantmediumvsGLM 4.7 Flashnone