Context Window
262,144
Input tokens
Full-context input โ $0.10
Model Cost Profile
Developer: xiaomiยท Tokenizer: Other ยท Quantization: fp8
Canonical ID: xiaomi/mimo-v2-omni-20260318
Pricing updated May 3, 2026
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
Context Window
262,144
Input tokens
Full-context input โ $0.10
Max Output
65,536
Completion tokens
Input Price / 1M
$0.4000
Prompt tokens
Output Price / 1M
$2.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.4000
Current Output / 1M
$2.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.4000 |
| Output (Completion) | $2.00 |
| Cache Read | $0.0800 |
Estimate monthly spend for Xiaomi: MiMo-V2-Omni based on your workload.
Estimated Monthly Cost
$34
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.