#89
DeepSeek · Release: 2026-04-24 · Tested on: 2026-04-24 09:19 · deepseek/deepseek-v4-flash::none
(high)
(none)
Input Price
$0.140 / 1M
Output Price
$0.280 / 1M
Flaky tests
2
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Run history
Charts
Choose the first model, then click a second model to open a side-by-side page.
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Quick Compare
DeepSeek V4 FlashnonevsGLM 4.7 FlashnoneDeepSeek V4 FlashnonevsKimi K2.5noneDeepSeek V4 FlashnonevsMistral Small 4mediumDeepSeek V4 FlashnonevsGLM 5 TurbononeDeepSeek V4 FlashnonevsLing 2.6 FlashnoneFree AvailableDeepSeek V4 FlashnonevsGemini 3 Flash PreviewmediumDeepSeek V4 FlashnonevsGemini 3.1 Pro PreviewmediumDeepSeek V4 FlashnonevsHY3 PreviewhighFree Available
Category Breakdown
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 3.0 | 10.0 | |
| Coding | 6.3 | 10.0 | |
| Combined | 4.5 | 2.1 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 5.3 | 10.0 | |
| General Intelligence | 4.2 | 9.9 | |
| Instructions following | 6.5 | 10.0 | |
| Puzzle Solving | 3.1 | 7.3 | |
| Tool Calling | 10.0 | 10.0 |