Context Window
262,144
Input tokens
Full-context input β $0.02
Model Cost Profile
Developer: inclusionaiΒ· Tokenizer: Other
Canonical ID: inclusionai/ling-2.6-flash-20260421
Pricing updated Apr 29, 2026
Live Pricing
Input: $0.0800
Output: $0.2400
Last synced Apr 29, 2026 Β· MMLU score via public benchmark data
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Context Window
262,144
Input tokens
Full-context input β $0.02
Max Output
32,768
Completion tokens
Input Price / 1M
$0.0800
Prompt tokens
Output Price / 1M
$0.2400
Completion tokens
Top Benchmark
77.7
MMLU score β highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for inclusionAI: Ling-2.6-flash. The βTop Benchmarkβ shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Not enough data yet. Price tracking started recently β check back in a few days.
Performance History
Not enough data yet. Performance tracking started recently β check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0800 |
| Output (Completion) | $0.2400 |
| Cache Read | $0.0160 |
Estimate monthly spend for inclusionAI: Ling-2.6-flash based on your workload.
Estimated Monthly Cost
$4.88
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.