Context Window
32,768
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Qwen2.5 7B Instruct by qwen is designed for applications requiring extensive context, with a generous context window of 32,768 tokens, making it suitable for complex tasks such as document summarization and conversational AI. Teams leveraging this API model can expect an input cost of $0.04 per million tokens and an output cost of $0.10 per million tokens, which can significantly impact budget planning based on usage patterns. This model is particularly advantageous for organizations that need to process large volumes of text while maintaining high-quality outputs in real-time applications.
Context Window
32,768
Tokens
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.1000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.1000 |
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.1000
Estimate monthly spend for Qwen: Qwen2.5 7B Instruct based on your workload.
Estimated Monthly Cost
$2.20
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen2.5 7B Instruct.
Qwen: Qwen2.5 7B Instruct input pricing is $0.0400 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 7B Instruct output pricing is $0.1000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 7B Instruct supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.