Context Window
131,072
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.0800
Output: $0.5000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen3 VL 8B Instruct, developed by qwen, is designed for applications requiring extensive context, accommodating up to 131,072 tokens for complex tasks such as document summarization and conversational agents. Teams utilizing this API model can expect an input cost of $0.08 per million tokens and an output cost of $0.50 per million tokens, making it suitable for projects with significant data processing needs. Its advanced instruction-following capabilities enable efficient handling of diverse use cases, from natural language understanding to content generation.
Context Window
131,072
Tokens
Input Price / 1M
$0.0800
Prompt tokens
Output Price / 1M
$0.5000
Completion tokens
Intelligence (MMLU)
68.6
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: Qwen3 VL 8B Instruct.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0800 |
| Output (Completion) | $0.5000 |
Price History
Current Input / 1M
$0.0800
Current Output / 1M
$0.5000
Estimate monthly spend for Qwen: Qwen3 VL 8B Instruct based on your workload.
Estimated Monthly Cost
$8.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen3 VL 8B Instruct.
Qwen: Qwen3 VL 8B Instruct input pricing is $0.0800 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 8B Instruct output pricing is $0.5000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 8B Instruct supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.