Context Window
40,960
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Instruct: qwen3 · Quantization: int4
Canonical ID: qwen/qwen3-14b-04-28
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.0600
Output: $0.2400
Last synced Apr 23, 2026 · MMLU score via public benchmark data
Qwen3 14B, developed by Qwen, offers a substantial context window of 40,960 tokens, making it suitable for applications requiring extensive text comprehension and generation, such as document summarization and conversational AI. Teams utilizing this API model can expect an input cost of $0.06 per million tokens and an output cost of $0.24 per million tokens, which allows for budget forecasting based on usage patterns. This pricing structure supports various use cases, from large-scale content creation to interactive chatbots, enabling teams to optimize their spending according to their specific project needs.
Context Window
40,960
Input tokens
Full-context input ≈ $0.00
Max Output
40,960
Completion tokens
Input Price / 1M
$0.0600
Prompt tokens
Output Price / 1M
$0.2400
Completion tokens
Top Benchmark
67.5
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 14B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0600
Current Output / 1M
$0.2400
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0600 |
| Output (Completion) | $0.2400 |
Estimate monthly spend for Qwen: Qwen3 14B based on your workload.
Estimated Monthly Cost
$4.38
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.