Model Cost Profile

Qwen: Qwen2.5 VL 72B Instruct

Developer: qwen· Tokenizer: Qwen · Quantization: fp8

Canonical ID: qwen/qwen2.5-vl-72b-instruct

Pricing updated Apr 24, 2026

Input rank: #157Output rank: #147

Live Pricing

Input: $0.2500

Output: $0.7500

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen2.5 VL 72B Instruct, developed by qwen, is designed for applications requiring extensive context handling, supporting a context window of 32,768 tokens. This model is particularly suited for complex tasks such as document summarization, conversational agents, and multi-turn dialogue systems, making it ideal for teams that need to process large volumes of text data efficiently. With a competitive pricing structure of $0.80 per million tokens for both input and output, organizations can effectively manage their budget while leveraging advanced AI capabilities.

👁 Vision📋 Structured Output

Context Window

32,000

Input tokens

Full-context input ≈ $0.01

Max Output

—

Not specified

Input Price / 1M

$0.2500

Prompt tokens

Output Price / 1M

$0.7500

Completion tokens

Top Benchmark

72.0

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen2.5 VL 72B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	49.1	#95 of 125	artificial_analysis
MMLU	72.0	#85 of 127	artificial_analysis

Price History

Qwen: Qwen2.5 VL 72B Instruct Pricing Trend

Input / 1M tokens-68.8%Output / 1M tokens-6.2%

Current Input / 1M

$0.2500

Current Output / 1M

$0.7500

Performance History

Qwen: Qwen2.5 VL 72B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2500
Output (Completion)	$0.7500

Compare with Qwen: Qwen Plus 0728 Compare with Anthropic: Claude 3 Haiku Compare with ByteDance Seed: Seed 1.6

Cost Calculator

Estimate monthly spend for Qwen: Qwen2.5 VL 72B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$15 Free Models Router$0.00−$15 Google: Gemma 3 12B (free)$0.00−$15 Google: Gemma 3 27B (free)$0.00−$15

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen2.5 VL 72B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen2.5 VL 72B Instruct vs Free Models Router Qwen: Qwen2.5 VL 72B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen2.5 VL 72B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

49.1

#95 of 125

artificial_analysis

MMLU

72.0

#85 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2500

Output (Completion)

$0.7500

Cost Calculator

Estimate monthly spend for Qwen: Qwen2.5 VL 72B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens