Context Window
16,384
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: microsoft· Tokenizer: Other · Quantization: int4
Canonical ID: microsoft/phi-4
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0650
Output: $0.1400
Last synced Apr 24, 2026
Microsoft Phi 4 offers a substantial context window of 16,384 tokens, making it suitable for complex applications such as document summarization and conversational AI. With an input cost of $0.06 per million tokens and an output cost of $0.14 per million tokens, teams can effectively manage their budgets while scaling their usage based on project requirements. This model is particularly beneficial for enterprises that require extensive context for tasks like legal document analysis or customer support automation.
Context Window
16,384
Input tokens
Full-context input ≈ $0.00
Max Output
16,384
Completion tokens
Input Price / 1M
$0.0650
Prompt tokens
Output Price / 1M
$0.1400
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0650
Current Output / 1M
$0.1400
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
0.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0650 |
| Output (Completion) | $0.1400 |
Estimate monthly spend for Microsoft: Phi 4 based on your workload.
Estimated Monthly Cost
$3.31
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.