Context Window
65,535
Input tokens
Full-context input ≈ $0.04
Model Cost Profile
Developer: microsoft· Tokenizer: Mistral · Instruct: vicuna · Quantization: bf16
Canonical ID: microsoft/wizardlm-2-8x22b
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.6200
Output: $0.6200
Last synced Apr 22, 2026
WizardLM-2 8x22B, developed by Microsoft, offers a substantial context window of 65,535 tokens, making it ideal for complex applications such as document analysis and conversational AI where extensive context is crucial. Teams utilizing this API model can expect an input and output pricing of $0.62 per 1 million tokens, allowing for cost-effective scaling in projects requiring large data processing. This model is particularly suited for enterprises focused on natural language understanding tasks that demand high context retention and operational efficiency.
Context Window
65,535
Input tokens
Full-context input ≈ $0.04
Max Output
8,000
Completion tokens
Input Price / 1M
$0.6200
Prompt tokens
Output Price / 1M
$0.6200
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.6200
Current Output / 1M
$0.6200
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.6200 |
| Output (Completion) | $0.6200 |
Estimate monthly spend for WizardLM-2 8x22B based on your workload.
Estimated Monthly Cost
$23
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.