#35
Openrouter ยท Release: Unknown release date ยท openrouter/hunter-alpha::medium
Flaky tests
5
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Charts
Choose the first model, then click a second model to open a side-by-side page.
Avg Score vs Total Cost
Response Time (avg)
Avg Score vs Response Time (avg)
Total Output Tokens
Avg Score vs Total Output Tokens
Quick Compare
Hunter AlphamediumvsGLM 5noneHunter AlphamediumvsGrok 4.1 FastmediumHunter AlphamediumvsNemotron 3 Super 120b A12bmediumFree AvailableHunter AlphamediumvsQwen3.5 Plus 2026-02-15noneHunter AlphamediumvsDeepSeek V3.2noneHunter AlphamediumvsGemini 3 Flash PreviewmediumHunter AlphamediumvsGemini 3.1 Pro PreviewmediumHunter AlphamediumvsStep 3.5 FlashmediumFree Available
Category Breakdown
| Category | Avg Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 7.0 | 7.2 | |
| Combined | 10.0 | 1.6 | |
| Data parsing and extraction | 9.9 | 10.0 | |
| Domain specific | 10.0 | 10.0 | |
| General Intelligence | 8.0 | 3.7 | |
| Instructions following | 9.5 | 10.0 | |
| Puzzle Solving | 4.3 | 4.7 | |
| Tool Calling | 10.0 | 10.0 |