Model Cost Profile

Qwen: Qwen3 30B A3B

Developer: qwen· Tokenizer: Qwen3 · Instruct: qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3-30b-a3b-04-28

Pricing updated Apr 24, 2026

Input rank: #71Output rank: #78

Live Pricing

Input: $0.0800

Output: $0.2800

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 30B A3B, developed by Qwen, features a substantial context window of 40,960 tokens, making it suitable for complex tasks such as document summarization and conversational AI applications. With an input price of $0.08 per million tokens and an output price of $0.28 per million tokens, teams can effectively manage costs while leveraging the model for extensive data processing and analysis. This pricing structure allows organizations to scale their usage based on project needs, optimizing budget allocation for large-scale AI implementations.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

40,960

Input tokens

Full-context input ≈ $0.00

Max Output

40,960

Completion tokens

Input Price / 1M

$0.0800

Prompt tokens

Output Price / 1M

$0.2800

Completion tokens

Top Benchmark

79.2

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 30B A3B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	72.6	#44 of 125	artificial_analysis
MMLU	79.2	#51 of 127	artificial_analysis

Price History

Qwen: Qwen3 30B A3B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0800

Current Output / 1M

$0.2800

Performance History

Qwen: Qwen3 30B A3B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.9%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0800
Output (Completion)	$0.2800

Compare with Qwen: Qwen3 30B A3B Thinking 2507 Compare with Google: Gemma 3 27B Compare with Meta: Llama 4 Scout

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 30B A3B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.36

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$5.36 Free Models Router$0.00−$5.36 Google: Gemma 3 12B (free)$0.00−$5.36 Google: Gemma 3 27B (free)$0.00−$5.36

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 30B A3B vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 30B A3B vs Free Models Router Qwen: Qwen3 30B A3B vs Google: Gemma 3 12B (free)Qwen: Qwen3 30B A3B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

72.6

#44 of 125

artificial_analysis

MMLU

79.2

#51 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0800

Output (Completion)

$0.2800

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 30B A3B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.36

25M input + 12M output tokens