Model Cost Profile

Qwen: Qwen3 30B A3B Instruct 2507

Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3-30b-a3b-instruct-2507

Pricing updated Apr 24, 2026

Input rank: #76Output rank: #89

Live Pricing

Input: $0.0900

Output: $0.3000

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 30B A3B Instruct 2507, developed by Qwen, offers a substantial context window of 262,144 tokens, making it ideal for applications requiring extensive data processing such as document summarization and complex dialogue systems. With an input cost of $0.09 per million tokens and an output cost of $0.30 per million tokens, teams can effectively manage their budgets while leveraging the model's advanced capabilities for tasks like content generation and interactive AI solutions. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring cost-effectiveness in high-demand scenarios.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

262,144

Completion tokens

Input Price / 1M

$0.0900

Prompt tokens

Output Price / 1M

$0.3000

Completion tokens

Top Benchmark

80.5

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 30B A3B Instruct 2507. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	70.7	#47 of 125	artificial_analysis
MMLU	80.5	#42 of 127	artificial_analysis

Price History

Qwen: Qwen3 30B A3B Instruct 2507 Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0900

Current Output / 1M

$0.3000

Performance History

Qwen: Qwen3 30B A3B Instruct 2507 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

91.4%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0900
Output (Completion)	$0.3000

Compare with Qwen: Qwen3 Next 80B A3B Instruct Compare with NVIDIA: Nemotron 3 Super Compare with Tongyi DeepResearch 30B A3B

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 30B A3B Instruct 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.85

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$5.85 Free Models Router$0.00−$5.85 Google: Gemma 3 12B (free)$0.00−$5.85 Google: Gemma 3 27B (free)$0.00−$5.85

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 30B A3B Instruct 2507 vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 30B A3B Instruct 2507 vs Free Models Router Qwen: Qwen3 30B A3B Instruct 2507 vs Google: Gemma 3 12B (free)Qwen: Qwen3 30B A3B Instruct 2507 vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

70.7

#47 of 125

artificial_analysis

MMLU

80.5

#42 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0900

Output (Completion)

$0.3000

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 30B A3B Instruct 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.85

25M input + 12M output tokens