Model Cost Profile

Nous: Hermes 3 70B Instruct

Developer: nousresearch· Tokenizer: Llama3 · Instruct: chatml · Quantization: fp8

Canonical ID: nousresearch/hermes-3-llama-3.1-70b

Pricing updated Apr 24, 2026

Input rank: #177Output rank: #87

Live Pricing

Input: $0.3000

Output: $0.3000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Nous: Hermes 3 70B Instruct, developed by nousresearch, features an extensive context window of 65,536 tokens, making it suitable for complex applications such as long-form content generation, detailed data analysis, and multi-turn conversational AI. With an input and output pricing of $0.30 per 1 million tokens, teams can effectively manage costs while leveraging the model for both high-volume processing and intricate tasks. This model's capabilities are ideal for businesses needing to handle large datasets or provide nuanced responses in customer service and interactive applications.

📋 Structured Output

Context Window

131,072

Input tokens

Full-context input ≈ $0.04

Max Output

—

Not specified

Input Price / 1M

$0.3000

Prompt tokens

Output Price / 1M

$0.3000

Completion tokens

Top Benchmark

66.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Nous: Hermes 3 70B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	49.1	#94 of 125	artificial_analysis
MMLU	66.4	#106 of 127	artificial_analysis

Price History

Nous: Hermes 3 70B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.3000

Current Output / 1M

$0.3000

Performance History

Nous: Hermes 3 70B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.3000
Output (Completion)	$0.3000

Compare with NousResearch: Hermes 2 Pro - Llama-3 8B Compare with Amazon: Nova 2 Lite Compare with Google: Gemini 2.5 Flash

Cost Calculator

Estimate monthly spend for Nous: Hermes 3 70B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$11 Free Models Router$0.00−$11 Google: Gemma 3 12B (free)$0.00−$11 Google: Gemma 3 27B (free)$0.00−$11

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Nous: Hermes 3 70B Instruct vs Baidu: Qianfan-OCR-Fast (free)Nous: Hermes 3 70B Instruct vs Free Models Router Nous: Hermes 3 70B Instruct vs Google: Gemma 3 12B (free)Nous: Hermes 3 70B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

49.1

#94 of 125

artificial_analysis

MMLU

66.4

#106 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.3000

Output (Completion)

$0.3000

Cost Calculator

Estimate monthly spend for Nous: Hermes 3 70B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens