Model Cost Profile

Qwen: Qwen3 VL 30B A3B Instruct

Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-vl-30b-a3b-instruct

Pricing updated Apr 24, 2026

Input rank: #104Output rank: #125

Live Pricing

Input: $0.1300

Output: $0.5200

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 VL 30B A3B Instruct, developed by Qwen, offers a substantial context window of 131,072 tokens, making it suitable for applications requiring extensive text analysis or generation, such as legal document review or long-form content creation. With an input price of $0.13 per million tokens and an output price of $0.52 per million tokens, teams can effectively manage costs while leveraging the model's capabilities for complex tasks. This model is particularly advantageous for organizations needing to process large datasets or engage in detailed conversational AI applications, ensuring both efficiency and scalability.

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

131,072

Input tokens

Full-context input ≈ $0.02

Max Output

32,768

Completion tokens

Input Price / 1M

$0.1300

Prompt tokens

Output Price / 1M

$0.5200

Completion tokens

Top Benchmark

72.5

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 VL 30B A3B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	62.0	#62 of 125	artificial_analysis
MMLU	72.5	#83 of 127	artificial_analysis

Price History

Qwen: Qwen3 VL 30B A3B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1300

Current Output / 1M

$0.5200

Performance History

Qwen: Qwen3 VL 30B A3B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1300
Output (Completion)	$0.5200

Compare with Qwen: Qwen3 VL 30B A3B Thinking Compare with Google: Gemma 4 31B Compare with Nous: Hermes 4 70B

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 30B A3B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.49

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$9.49 Free Models Router$0.00−$9.49 Google: Gemma 3 12B (free)$0.00−$9.49 Google: Gemma 3 27B (free)$0.00−$9.49

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 VL 30B A3B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 VL 30B A3B Instruct vs Free Models Router Qwen: Qwen3 VL 30B A3B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen3 VL 30B A3B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

62.0

#62 of 125

artificial_analysis

MMLU

72.5

#83 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1300

Output (Completion)

$0.5200

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 30B A3B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.49

25M input + 12M output tokens