Context Window
131,072
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.1170
Output: $1.37
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen: Qwen3 VL 8B Thinking is designed for applications requiring extensive context processing, offering a substantial context window of 131072 tokens that supports complex tasks such as document summarization and conversational AI. With an input price of $0.12 per million tokens and an output price of $1.36 per million tokens, teams can effectively budget for high-volume use cases while managing operational costs. This model is particularly suited for businesses needing scalable solutions for natural language understanding and generation, making it a cost-effective choice for data-intensive projects.
Context Window
131,072
Tokens
Input Price / 1M
$0.1170
Prompt tokens
Output Price / 1M
$1.37
Completion tokens
Intelligence (MMLU)
74.9
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: Qwen3 VL 8B Thinking.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1170 |
| Output (Completion) | $1.37 |
Price History
Current Input / 1M
$0.1170
Current Output / 1M
$1.36
Estimate monthly spend for Qwen: Qwen3 VL 8B Thinking based on your workload.
Estimated Monthly Cost
$19
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen3 VL 8B Thinking.
Qwen: Qwen3 VL 8B Thinking input pricing is $0.1170 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 8B Thinking output pricing is $1.37 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 8B Thinking supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.