Model Cost Profile

Qwen: Qwen3 14B

Developer: qwen· Tokenizer: Qwen3 · Instruct: qwen3 · Quantization: int4

Canonical ID: qwen/qwen3-14b-04-28

Pricing updated Apr 23, 2026

Input rank: #54Output rank: #69

Live Pricing

Input: $0.0600

Output: $0.2400

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

Qwen3 14B, developed by Qwen, offers a substantial context window of 40,960 tokens, making it suitable for applications requiring extensive text comprehension and generation, such as document summarization and conversational AI. Teams utilizing this API model can expect an input cost of $0.06 per million tokens and an output cost of $0.24 per million tokens, which allows for budget forecasting based on usage patterns. This pricing structure supports various use cases, from large-scale content creation to interactive chatbots, enabling teams to optimize their spending according to their specific project needs.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

40,960

Input tokens

Full-context input ≈ $0.00

Max Output

40,960

Completion tokens

Input Price / 1M

$0.0600

Prompt tokens

Output Price / 1M

$0.2400

Completion tokens

Top Benchmark

67.5

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 14B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	47.0	#105 of 125	artificial_analysis
MMLU	67.5	#106 of 127	artificial_analysis

Price History

Qwen: Qwen3 14B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0600

Current Output / 1M

$0.2400

Performance History

Qwen: Qwen3 14B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0600
Output (Completion)	$0.2400

Compare with Qwen: Qwen3.5-Flash Compare with Amazon: Nova Lite 1.0 Compare with Google: Gemma 3n 4B

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 14B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$4.38

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$4.38 Free Models Router$0.00−$4.38 Google: Gemma 3 12B (free)$0.00−$4.38 Google: Gemma 3 27B (free)$0.00−$4.38

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 14B vs Arcee AI: Trinity Large Preview (free)Qwen: Qwen3 14B vs Free Models Router Qwen: Qwen3 14B vs Google: Gemma 3 12B (free)Qwen: Qwen3 14B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

47.0

#105 of 125

artificial_analysis

MMLU

67.5

#106 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0600

Output (Completion)

$0.2400