Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: xiaomi· Tokenizer: Other · Quantization: fp8
Canonical ID: xiaomi/mimo-v2-flash-20251210
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0900
Output: $0.2900
Last synced Apr 24, 2026 · MMLU score via public benchmark data
The Xiaomi MiMo-V2-Flash model offers an extensive context window of 262,144 tokens, making it suitable for applications requiring in-depth analysis and long-form content generation. With an input price of $0.09 per million tokens and an output price of $0.29 per million tokens, teams can effectively manage costs while leveraging the model for tasks such as document summarization, conversational agents, and complex data processing. This pricing structure allows organizations to scale their usage based on project needs, optimizing budget allocation for API consumption.
Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Max Output
65,536
Completion tokens
Input Price / 1M
$0.0900
Prompt tokens
Output Price / 1M
$0.2900
Completion tokens
Top Benchmark
74.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Xiaomi: MiMo-V2-Flash. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0900
Current Output / 1M
$0.2900
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
75.1%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0900 |
| Output (Completion) | $0.2900 |
| Cache Read | $0.0450 |
Estimate monthly spend for Xiaomi: MiMo-V2-Flash based on your workload.
Estimated Monthly Cost
$5.73
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.