Context Window
131,072
Tokens
Model Cost Profile
Developer: nousresearch
Pricing updated Mar 11, 2026
Nous: Hermes 4 70B, developed by nousresearch, offers a substantial context window of 131,072 tokens, making it ideal for applications requiring extensive document analysis or multi-turn conversations. Teams leveraging this API model can expect an input cost of $0.13 per million tokens and an output cost of $0.40 per million tokens, which can significantly impact budget considerations for high-volume usage scenarios. This model is particularly suited for industries such as legal, healthcare, and customer support, where detailed context and nuanced understanding are essential.
Context Window
131,072
Tokens
Input Price / 1M
$0.1300
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1300 |
| Output (Completion) | $0.4000 |
Price History
Current Input / 1M
$0.1300
Current Output / 1M
$0.4000
Estimate monthly spend for Nous: Hermes 4 70B based on your workload.
Estimated Monthly Cost
$8.05
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Nous: Hermes 4 70B.
Nous: Hermes 4 70B input pricing is $0.1300 per 1M tokens based on the latest synced provider data.
Nous: Hermes 4 70B output pricing is $0.4000 per 1M tokens based on the latest synced provider data.
Nous: Hermes 4 70B supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.