Model Cost Profile

Nous: Hermes 4 70B

Developer: nousresearch· Tokenizer: Llama3 · Quantization: fp8

Canonical ID: nousresearch/hermes-4-70b

Pricing updated Apr 22, 2026

Input rank: #101Output rank: #100

Live Pricing

Input: $0.1300

Output: $0.4000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 22, 2026

Nous: Hermes 4 70B, developed by nousresearch, offers a substantial context window of 131,072 tokens, making it ideal for applications requiring extensive document analysis or multi-turn conversations. Teams leveraging this API model can expect an input cost of $0.13 per million tokens and an output cost of $0.40 per million tokens, which can significantly impact budget considerations for high-volume usage scenarios. This model is particularly suited for industries such as legal, healthcare, and customer support, where detailed context and nuanced understanding are essential.

📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.1300

Prompt tokens

Output Price / 1M

$0.4000

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Nous: Hermes 4 70B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1300

Current Output / 1M

$0.4000

Performance History

Nous: Hermes 4 70B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1300
Output (Completion)	$0.4000

Compare with NousResearch: Hermes 2 Pro - Llama-3 8B Compare with Google: Gemma 4 31B Compare with Qwen: Qwen3 235B A22B Thinking 2507

Cost Calculator

Estimate monthly spend for Nous: Hermes 4 70B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$8.05

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$8.05 Free Models Router$0.00−$8.05 Google: Gemma 3 12B (free)$0.00−$8.05 Google: Gemma 3 27B (free)$0.00−$8.05

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Nous: Hermes 4 70B vs Arcee AI: Trinity Large Preview (free)Nous: Hermes 4 70B vs Free Models Router Nous: Hermes 4 70B vs Google: Gemma 3 12B (free)Nous: Hermes 4 70B vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1300

Output (Completion)

$0.4000