Model Cost Profile

WizardLM-2 8x22B

Developer: microsoft· Tokenizer: Mistral · Instruct: vicuna · Quantization: bf16

Canonical ID: microsoft/wizardlm-2-8x22b

Pricing updated Apr 22, 2026

Input rank: #219Output rank: #139

Live Pricing

Input: $0.6200

Output: $0.6200

Visit Microsoft ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 22, 2026

WizardLM-2 8x22B, developed by Microsoft, offers a substantial context window of 65,535 tokens, making it ideal for complex applications such as document analysis and conversational AI where extensive context is crucial. Teams utilizing this API model can expect an input and output pricing of $0.62 per 1 million tokens, allowing for cost-effective scaling in projects requiring large data processing. This model is particularly suited for enterprises focused on natural language understanding tasks that demand high context retention and operational efficiency.

Context Window

65,535

Input tokens

Full-context input ≈ $0.04

Max Output

8,000

Completion tokens

Input Price / 1M

$0.6200

Prompt tokens

Output Price / 1M

$0.6200

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

WizardLM-2 8x22B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.6200

Current Output / 1M

$0.6200

Performance History

WizardLM-2 8x22B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.6200
Output (Completion)	$0.6200

Compare with Microsoft: Phi 4 Compare with MoonshotAI: Kimi K2 Thinking Compare with MoonshotAI: Kimi K2.6

Cost Calculator

Estimate monthly spend for WizardLM-2 8x22B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$23

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$23 Free Models Router$0.00−$23 Google: Gemma 3 12B (free)$0.00−$23 Google: Gemma 3 27B (free)$0.00−$23

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

WizardLM-2 8x22B vs Arcee AI: Trinity Large Preview (free)WizardLM-2 8x22B vs Free Models Router WizardLM-2 8x22B vs Google: Gemma 3 12B (free)WizardLM-2 8x22B vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.6200

Output (Completion)

$0.6200

Cost Calculator

Estimate monthly spend for WizardLM-2 8x22B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$23

25M input + 12M output tokens