Context Window
131,072
Input tokens
Full-context input ≈ $0.07
Model Cost Profile
Developer: moonshotai· Tokenizer: Other · Quantization: fp8
Canonical ID: moonshotai/kimi-k2
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.5700
Output: $2.30
Last synced Apr 22, 2026 · MMLU score via public benchmark data
MoonshotAI's Kimi K2 0711 model offers a substantial context window of 131072 tokens, making it ideal for complex applications such as long-form content generation, extensive data analysis, and in-depth conversational agents. With an input pricing of $0.50 per million tokens and an output cost of $2.40 per million tokens, teams can effectively budget for projects that require significant text processing and generation capabilities. This pricing structure allows organizations to scale their usage based on specific needs, optimizing costs for both small and large-scale deployments.
Context Window
131,072
Input tokens
Full-context input ≈ $0.07
Max Output
32,768
Completion tokens
Input Price / 1M
$0.5700
Prompt tokens
Output Price / 1M
$2.30
Completion tokens
Top Benchmark
82.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for MoonshotAI: Kimi K2 0711. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.5700
Current Output / 1M
$2.30
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.5700 |
| Output (Completion) | $2.30 |
Estimate monthly spend for MoonshotAI: Kimi K2 0711 based on your workload.
Estimated Monthly Cost
$42
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.