Model Cost Profile

EleutherAI: Llemma 7b

Developer: eleutherai

Pricing updated Mar 11, 2026

Input rank: #232Output rank: #185

Live Pricing

Input: $0.8000

Output: $1.20

Pricing via OpenRouter API · Last synced Mar 11, 2026

EleutherAI's Llemma 7b is a versatile language model with a context window of 4096 tokens, making it suitable for applications such as chatbots, content generation, and data analysis. Teams leveraging this API model can expect an input price of $0.80 per 1 million tokens and an output price of $1.20 per 1 million tokens, which can impact budgeting depending on the volume of data processed. Its efficient architecture allows for scalable integration into various workflows, enhancing productivity while managing costs effectively.

Context Window

4,096

Tokens

Input Price / 1M

$0.8000

Prompt tokens

Output Price / 1M

$1.20

Completion tokens

Intelligence (MMLU)

Benchmark Pending

Massive Multitask Language Understanding

Price History

EleutherAI: Llemma 7b Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%
Mar 7Mar 11
$0.8000$1.00$1.20Mar 7Mar 8Mar 9Mar 10Mar 11

Current Input / 1M

$0.8000

Current Output / 1M

$1.20

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

FAQ

Common pricing and benchmark questions for EleutherAI: Llemma 7b.

How much does EleutherAI: Llemma 7b cost per 1M input tokens?

EleutherAI: Llemma 7b input pricing is $0.8000 per 1M tokens based on the latest synced provider data.

How much does EleutherAI: Llemma 7b cost per 1M output tokens?

EleutherAI: Llemma 7b output pricing is $1.20 per 1M tokens based on the latest synced provider data.

What context window does EleutherAI: Llemma 7b support?

EleutherAI: Llemma 7b supports a context window of 4,096 tokens.

How can I compare EleutherAI: Llemma 7b with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.