Context Window
131,072
Input tokens
Full-context input โ $0.13
Model Cost Profile
Developer: nousresearchยท Tokenizer: Other ยท Quantization: fp8
Canonical ID: nousresearch/hermes-4-405b
Pricing updated Apr 22, 2026
Live Pricing
Input: $1.00
Output: $3.00
Last synced Apr 22, 2026
Nous: Hermes 4 405B, developed by nousresearch, offers a substantial context window of 131072 tokens, making it ideal for applications requiring in-depth analysis of large datasets or extensive document processing. With an input price of $1.00 per million tokens and an output price of $3.00 per million tokens, teams can effectively budget for projects involving complex queries or content generation at scale. This model is particularly suited for industries such as legal, academic, and research where comprehensive context and detailed responses are critical for decision-making.
Context Window
131,072
Input tokens
Full-context input โ $0.13
Max Output
โ
Not specified
Input Price / 1M
$1.00
Prompt tokens
Output Price / 1M
$3.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$1.00
Current Output / 1M
$3.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $1.00 |
| Output (Completion) | $3.00 |
Estimate monthly spend for Nous: Hermes 4 405B based on your workload.
Estimated Monthly Cost
$61
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.