Model Cost Profile

NousResearch: Hermes 2 Pro - Llama-3 8B

Developer: nousresearch· Tokenizer: Llama3 · Instruct: chatml · Quantization: fp16

Canonical ID: nousresearch/hermes-2-pro-llama-3-8b

Pricing updated Apr 24, 2026

Input rank: #110Output rank: #51

Live Pricing

Input: $0.1400

Output: $0.1400

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026

NousResearch's Hermes 2 Pro - Llama-3 8B model offers a substantial context window of 8192 tokens, making it suitable for complex applications such as document summarization and conversational AI. With an input and output pricing of $0.14 per million tokens, teams can effectively manage costs while scaling their usage for projects that require extensive data processing. This model is ideal for businesses looking to integrate advanced language capabilities into their products without incurring prohibitive expenses.

📋 Structured Output

Context Window

8,192

Input tokens

Full-context input ≈ $0.00

Max Output

8,192

Completion tokens

Input Price / 1M

$0.1400

Prompt tokens

Output Price / 1M

$0.1400

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

NousResearch: Hermes 2 Pro - Llama-3 8B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1400

Current Output / 1M

$0.1400

Performance History

NousResearch: Hermes 2 Pro - Llama-3 8B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1400
Output (Completion)	$0.1400

Compare with Nous: Hermes 4 70B Compare with Baidu: ERNIE 4.5 VL 28B A3B Compare with Tencent: Hunyuan A13B Instruct

Cost Calculator

Estimate monthly spend for NousResearch: Hermes 2 Pro - Llama-3 8B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.18

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$5.18 Free Models Router$0.00−$5.18 Google: Gemma 3 12B (free)$0.00−$5.18 Google: Gemma 3 27B (free)$0.00−$5.18

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NousResearch: Hermes 2 Pro - Llama-3 8B vs Baidu: Qianfan-OCR-Fast (free)NousResearch: Hermes 2 Pro - Llama-3 8B vs Free Models Router NousResearch: Hermes 2 Pro - Llama-3 8B vs Google: Gemma 3 12B (free)NousResearch: Hermes 2 Pro - Llama-3 8B vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1400

Output (Completion)

$0.1400

Cost Calculator

Estimate monthly spend for NousResearch: Hermes 2 Pro - Llama-3 8B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$5.18

25M input + 12M output tokens