Model Cost Profile

Xiaomi: MiMo-V2-Flash

Developer: xiaomi· Tokenizer: Other · Quantization: fp8

Canonical ID: xiaomi/mimo-v2-flash-20251210

Pricing updated Apr 24, 2026

Input rank: #79Output rank: #80

Live Pricing

Input: $0.0900

Output: $0.2900

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

The Xiaomi MiMo-V2-Flash model offers an extensive context window of 262,144 tokens, making it suitable for applications requiring in-depth analysis and long-form content generation. With an input price of $0.09 per million tokens and an output price of $0.29 per million tokens, teams can effectively manage costs while leveraging the model for tasks such as document summarization, conversational agents, and complex data processing. This pricing structure allows organizations to scale their usage based on project needs, optimizing budget allocation for API consumption.

💡 Enable prompt caching to save 50% on repeated input tokens ($0.0450/M cached vs $0.0900/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

65,536

Completion tokens

Input Price / 1M

$0.0900

Prompt tokens

Output Price / 1M

$0.2900

Completion tokens

Top Benchmark

74.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Xiaomi: MiMo-V2-Flash. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	65.6	#55 of 125	artificial_analysis
MMLU	74.4	#74 of 127	artificial_analysis

Price History

Xiaomi: MiMo-V2-Flash Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0900

Current Output / 1M

$0.2900

Performance History

Xiaomi: MiMo-V2-Flash Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

75.1%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0900
Output (Completion)	$0.2900
Cache Read	$0.0450

Compare with Xiaomi: MiMo-V2-Omni Compare with NVIDIA: Nemotron 3 Super Compare with Qwen: Qwen3 30B A3B Instruct 2507

Cost Calculator

Estimate monthly spend for Xiaomi: MiMo-V2-Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.73

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$5.73 Free Models Router$0.00−$5.73 Google: Gemma 3 12B (free)$0.00−$5.73 Google: Gemma 3 27B (free)$0.00−$5.73

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Xiaomi: MiMo-V2-Flash vs Baidu: Qianfan-OCR-Fast (free)Xiaomi: MiMo-V2-Flash vs Free Models Router Xiaomi: MiMo-V2-Flash vs Google: Gemma 3 12B (free)Xiaomi: MiMo-V2-Flash vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

65.6

#55 of 125

artificial_analysis

MMLU

74.4

#74 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0900

Output (Completion)

$0.2900

Cache Read

$0.0450

Cost Calculator

Estimate monthly spend for Xiaomi: MiMo-V2-Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.73

25M input + 12M output tokens