Context Window
32,768
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.8000
Output: $0.8000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen2.5 VL 72B Instruct, developed by qwen, is designed for applications requiring extensive context handling, supporting a context window of 32,768 tokens. This model is particularly suited for complex tasks such as document summarization, conversational agents, and multi-turn dialogue systems, making it ideal for teams that need to process large volumes of text data efficiently. With a competitive pricing structure of $0.80 per million tokens for both input and output, organizations can effectively manage their budget while leveraging advanced AI capabilities.
Context Window
32,768
Tokens
Input Price / 1M
$0.8000
Prompt tokens
Output Price / 1M
$0.8000
Completion tokens
Intelligence (MMLU)
72.0
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: Qwen2.5 VL 72B Instruct.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.8000 |
| Output (Completion) | $0.8000 |
Price History
Current Input / 1M
$0.8000
Current Output / 1M
$0.8000
Estimate monthly spend for Qwen: Qwen2.5 VL 72B Instruct based on your workload.
Estimated Monthly Cost
$30
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen2.5 VL 72B Instruct.
Qwen: Qwen2.5 VL 72B Instruct input pricing is $0.8000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 VL 72B Instruct output pricing is $0.8000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 VL 72B Instruct supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.