Input Price
$1.200 / 1M
Output Price
$4.000 / 1M
Flaky tests
6
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Charts
Choose the first model, then click a second model to open a side-by-side page.
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Quick Compare
GLM 5V TurbomediumvsGemini 3.1 Flash Lite PreviewnoneGLM 5V TurbomediumvsKimi K2.6mediumGLM 5V TurbomediumvsStep 3.5 FlashmediumGLM 5V TurbomediumvsGemma 4 26B A4BmediumFree AvailableGLM 5V TurbomediumvsGemini 3.1 Flash LitelowGLM 5V TurbomediumvsGemini 3 Flash PreviewmediumGLM 5V TurbomediumvsGemini 3.1 Pro PreviewmediumGLM 5V TurbomediumvsRing 2.6 1tmediumFree Available
Category Breakdown
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 7.2 | 6.1 | |
| Coding | 10.0 | 10.0 | |
| Combined | 6.9 | 3.8 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 5.3 | 7.2 | |
| General Intelligence | 10.0 | 10.0 | |
| Instructions following | 9.9 | 10.0 | |
| Puzzle Solving | 7.6 | 7.2 | |
| Tool Calling | 7.0 | 3.7 | |
| Trivia | 3.0 | 10.0 |