Context Window
32,768
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Qwen2.5 72B Instruct is designed for advanced natural language processing tasks, making it suitable for applications such as chatbots, content generation, and data analysis. With a context window of 32,768 tokens, this model excels in handling extensive dialogues and complex queries, allowing teams to maintain context over longer interactions. Pricing for the API is competitive, with an input cost of $0.12 per million tokens and an output cost of $0.39 per million tokens, making it a cost-effective choice for organizations requiring scalable language solutions.
Context Window
32,768
Tokens
Input Price / 1M
$0.1200
Prompt tokens
Output Price / 1M
$0.3900
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1200 |
| Output (Completion) | $0.3900 |
Price History
Current Input / 1M
$0.1200
Current Output / 1M
$0.3900
Estimate monthly spend for Qwen2.5 72B Instruct based on your workload.
Estimated Monthly Cost
$7.68
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen2.5 72B Instruct.
Qwen2.5 72B Instruct input pricing is $0.1200 per 1M tokens based on the latest synced provider data.
Qwen2.5 72B Instruct output pricing is $0.3900 per 1M tokens based on the latest synced provider data.
Qwen2.5 72B Instruct supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.