Context Window
8,192
Tokens
Model Cost Profile
Developer: nousresearch
Pricing updated Mar 11, 2026
NousResearch's Hermes 2 Pro - Llama-3 8B model offers a substantial context window of 8192 tokens, making it suitable for complex applications such as document summarization and conversational AI. With an input and output pricing of $0.14 per million tokens, teams can effectively manage costs while scaling their usage for projects that require extensive data processing. This model is ideal for businesses looking to integrate advanced language capabilities into their products without incurring prohibitive expenses.
Context Window
8,192
Tokens
Input Price / 1M
$0.1400
Prompt tokens
Output Price / 1M
$0.1400
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1400 |
| Output (Completion) | $0.1400 |
Price History
Current Input / 1M
$0.1400
Current Output / 1M
$0.1400
Estimate monthly spend for NousResearch: Hermes 2 Pro - Llama-3 8B based on your workload.
Estimated Monthly Cost
$5.18
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NousResearch: Hermes 2 Pro - Llama-3 8B.
NousResearch: Hermes 2 Pro - Llama-3 8B input pricing is $0.1400 per 1M tokens based on the latest synced provider data.
NousResearch: Hermes 2 Pro - Llama-3 8B output pricing is $0.1400 per 1M tokens based on the latest synced provider data.
NousResearch: Hermes 2 Pro - Llama-3 8B supports a context window of 8,192 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.