---
title: "DeepSeek V4 Flash"
route_path: "/model/deepseek-v4-flash"
canonical_url: "https://www.pipellm.ai/model/deepseek-v4-flash"
markdown_path: "/llms/models/deepseek-v4-flash.md"
markdown_url: "https://www.pipellm.ai/llms/models/deepseek-v4-flash.md"
content_type: "model-detail-page"
description: "Machine-readable detail page for DeepSeek V4 Flash."
generated_at: "2026-05-29T04:11:23.834Z"
---
Canonical page: https://www.pipellm.ai/model/deepseek-v4-flash
Markdown mirror: https://www.pipellm.ai/llms/models/deepseek-v4-flash.md
Content type: model-detail-page
Generated at: 2026-05-29T04:11:23.834Z
# DeepSeek V4 Flash
## Query Intents
- Understand pricing, provider availability, context window, and capabilities for DeepSeek V4 Flash.
- Compare DeepSeek V4 Flash against other models available through PipeLLM.
- Find the canonical model identifier to use in SDK or API requests.
## Overview
DeepSeek V4 Flash is an efficiency-focused Mixture-of-Experts model built for fast inference, high-throughput applications, coding assistants, chat systems, and agent workflows. It keeps strong reasoning and coding performance while prioritizing responsiveness and cost efficiency.
## Model Metadata
- Display name: DeepSeek V4 Flash
- Model ID: deepseek-v4-flash
- Provider family: Deepseek
- Release date: 2026-04-24T03:17:46.000Z
- Context window: 1048.6K
- Max output: 16.4K
- Input modalities: text
- Output modalities: text
- Tool use support: Yes
- Computer use support: No
- Cache control support: No
## Official Pricing (per 1M tokens)
| Metric | <=200K Context | >200K Context |
| --- | --- | --- |
| Input Price | $0.1 | — |
| Output Price | $0.2 | — |
| Cache Read | $0.02 | — |
| Image Input | $0 | — |
| Image Output | $0 | — |

## Provider Availability
| Provider | Region | Context Window | Max Output | Input Price | Output Price | Cache Read | Cache Write |
| --- | --- | --- | --- | --- | --- | --- | --- |
| DeepInfra | — | 1048.6K | 16.4K | $0.1 | $0.2 | $0.02 | — |