Model Cost Profile

inclusionAI: Ling-2.6-flash🆕 New

Developer: inclusionai· Tokenizer: Other

Canonical ID: inclusionai/ling-2.6-flash-20260421

Pricing updated Apr 29, 2026

Input rank: #73Output rank: #74

Live Pricing

Input: $0.0800

Output: $0.2400

View full pricing leaderboard

Last synced Apr 29, 2026 · MMLU score via public benchmark data

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

💡 Enable prompt caching to save 80% on repeated input tokens ($0.0160/M cached vs $0.0800/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

32,768

Completion tokens

Input Price / 1M

$0.0800

Prompt tokens

Output Price / 1M

$0.2400

Completion tokens

Top Benchmark

77.7

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for inclusionAI: Ling-2.6-flash. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	65.7	#63 of 127	artificial_analysis
MMLU	77.7	#62 of 129	artificial_analysis

Price History

inclusionAI: Ling-2.6-flash Pricing Trend

Not enough data yet. Price tracking started recently — check back in a few days.

Performance History

inclusionAI: Ling-2.6-flash Speed Trend

Not enough data yet. Performance tracking started recently — check back in a few days.

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0800
Output (Completion)	$0.2400
Cache Read	$0.0160

Compare with inclusionAI: Ling-2.6-1T (free)Compare with Google: Gemma 3 27B Compare with Meta: Llama 4 Scout

Cost Calculator

Estimate monthly spend for inclusionAI: Ling-2.6-flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$4.88

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$4.88 Free Models Router$0.00−$4.88 Google: Gemma 3 12B (free)$0.00−$4.88 Google: Gemma 3 27B (free)$0.00−$4.88

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

inclusionAI: Ling-2.6-flash vs Baidu: Qianfan-OCR-Fast (free)inclusionAI: Ling-2.6-flash vs Free Models Router inclusionAI: Ling-2.6-flash vs Google: Gemma 3 12B (free)inclusionAI: Ling-2.6-flash vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

65.7

#63 of 127

artificial_analysis

MMLU

77.7

#62 of 129

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)