Context Window
131,072
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: nousresearch· Tokenizer: Llama3 · Instruct: chatml · Quantization: fp8
Canonical ID: nousresearch/hermes-3-llama-3.1-405b
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced Apr 25, 2026
Nous: Hermes 3 405B Instruct is a free API model developed by nousresearch, designed for tasks requiring extensive context understanding with a remarkable context window of 131072 tokens. This model is particularly useful for applications in natural language processing, such as chatbots, content generation, and complex document analysis, allowing teams to handle large volumes of data without incurring input or output costs. By eliminating pricing barriers, teams can leverage this model for scalable projects and experimentation, making it an attractive option for startups and research initiatives.
Context Window
131,072
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
90.1%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for Nous: Hermes 3 405B Instruct (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.