Model Cost Profile

Nous: Hermes 3 405B Instruct

Developer: nousresearch· Tokenizer: Llama3 · Instruct: chatml · Quantization: fp8

Canonical ID: nousresearch/hermes-3-llama-3.1-405b

Pricing updated Apr 23, 2026

Input rank: #247Output rank: #169

Live Pricing

Input: $1.00

Output: $1.00

HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

Nous: Hermes 3 405B Instruct, developed by nousresearch, offers a substantial context window of 131072 tokens, making it ideal for applications requiring extensive data processing and nuanced understanding, such as legal document analysis or comprehensive content generation. With an input and output pricing model set at $1.00 per million tokens, teams can effectively budget for high-volume tasks while maintaining cost efficiency in their API usage. This model is particularly advantageous for organizations that need to handle large datasets or complex queries without incurring prohibitive costs.

📋 Structured Output

Context Window

131,072

Input tokens

Full-context input ≈ $0.13

Max Output

16,384

Completion tokens

Input Price / 1M

$1.00

Prompt tokens

Output Price / 1M

$1.00

Completion tokens

Top Benchmark

82.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Nous: Hermes 3 405B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	72.7	#44 of 125	artificial_analysis
MMLU	82.9	#24 of 127	artificial_analysis

Price History

Nous: Hermes 3 405B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$1.00

Current Output / 1M

$1.00

Performance History

Nous: Hermes 3 405B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$1.00
Output (Completion)	$1.00

Compare with Nous: Hermes 4 405B Compare with Anthropic: Claude Haiku 4.5 Compare with OpenAI: GPT-3.5 Turbo (older v0613)

Cost Calculator

Estimate monthly spend for Nous: Hermes 3 405B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$37

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$37 Free Models Router$0.00−$37 Google: Gemma 3 12B (free)$0.00−$37 Google: Gemma 3 27B (free)$0.00−$37

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Nous: Hermes 3 405B Instruct vs Arcee AI: Trinity Large Preview (free)Nous: Hermes 3 405B Instruct vs Free Models Router Nous: Hermes 3 405B Instruct vs Google: Gemma 3 12B (free)Nous: Hermes 3 405B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

72.7

#44 of 125

artificial_analysis

MMLU

82.9

#24 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$1.00

Output (Completion)

$1.00

Cost Calculator

Estimate monthly spend for Nous: Hermes 3 405B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$37

25M input + 12M output tokens