Model Cost Profile

Qwen: Qwen3 VL 8B Thinking

Developer: qwen

Pricing updated Mar 11, 2026

Input rank: #100Output rank: #196

Live Pricing

Input: $0.1170

Output: $1.37

Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data

Qwen: Qwen3 VL 8B Thinking is designed for applications requiring extensive context processing, offering a substantial context window of 131072 tokens that supports complex tasks such as document summarization and conversational AI. With an input price of $0.12 per million tokens and an output price of $1.36 per million tokens, teams can effectively budget for high-volume use cases while managing operational costs. This model is particularly suited for businesses needing scalable solutions for natural language understanding and generation, making it a cost-effective choice for data-intensive projects.

๐Ÿ‘ Vision๐Ÿ”ง Tool Calling๐Ÿ“‹ Structured Output๐Ÿง  Reasoning

Context Window

131,072

Tokens

Input Price / 1M

$0.1170

Prompt tokens

Output Price / 1M

$1.37

Completion tokens

Intelligence (MMLU)

74.9

Massive Multitask Language Understanding

Benchmark Scores

Standardized evaluation scores for Qwen: Qwen3 VL 8B Thinking.

BenchmarkScoreRankSource
GPQA57.9#65 of 118artificial_analysis
MMLU74.9#65 of 121artificial_analysis

Price History

Qwen: Qwen3 VL 8B Thinking Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%
Mar 7 โ€” Mar 11
$0.1170$0.7410$1.36Mar 7Mar 8Mar 9Mar 10Mar 11

Current Input / 1M

$0.1170

Current Output / 1M

$1.36

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

FAQ

Common pricing and benchmark questions for Qwen: Qwen3 VL 8B Thinking.

How much does Qwen: Qwen3 VL 8B Thinking cost per 1M input tokens?

Qwen: Qwen3 VL 8B Thinking input pricing is $0.1170 per 1M tokens based on the latest synced provider data.

How much does Qwen: Qwen3 VL 8B Thinking cost per 1M output tokens?

Qwen: Qwen3 VL 8B Thinking output pricing is $1.37 per 1M tokens based on the latest synced provider data.

What context window does Qwen: Qwen3 VL 8B Thinking support?

Qwen: Qwen3 VL 8B Thinking supports a context window of 131,072 tokens.

How can I compare Qwen: Qwen3 VL 8B Thinking with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.