Model Cost Profile

NVIDIA: Nemotron Nano 9B V2

Developer: nvidia· Tokenizer: Other · Quantization: bf16

Canonical ID: nvidia/nemotron-nano-9b-v2

Pricing updated Apr 22, 2026

Input rank: #42Output rank: #57

Live Pricing

Input: $0.0400

Output: $0.1600

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 22, 2026 · MMLU score via public benchmark data

NVIDIA's Nemotron Nano 9B V2 features an extensive context window of 131,072 tokens, making it suitable for applications that require processing large volumes of text, such as document summarization and complex conversational agents. With an input price of $0.04 per 1M tokens and an output price of $0.16 per 1M tokens, teams can optimize their budget while leveraging advanced AI capabilities for projects like content generation and data analysis. This model is particularly beneficial for organizations needing scalable solutions that handle expansive datasets efficiently.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.01

Max Output

—

Not specified

Input Price / 1M

$0.0400

Prompt tokens

Output Price / 1M

$0.1600

Completion tokens

Top Benchmark

73.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for NVIDIA: Nemotron Nano 9B V2. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	55.7	#81 of 125	artificial_analysis
MMLU	73.9	#80 of 127	artificial_analysis

Price History

NVIDIA: Nemotron Nano 9B V2 Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0400

Current Output / 1M

$0.1600

Performance History

NVIDIA: Nemotron Nano 9B V2 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0400
Output (Completion)	$0.1600

Compare with NVIDIA: Nemotron 3 Nano 30B A3B Compare with Google: Gemma 3 12B Compare with Google: Gemma 3 4B

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 9B V2 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.92

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$2.92 Free Models Router$0.00−$2.92 Google: Gemma 3 12B (free)$0.00−$2.92 Google: Gemma 3 27B (free)$0.00−$2.92

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron Nano 9B V2 vs Arcee AI: Trinity Large Preview (free)NVIDIA: Nemotron Nano 9B V2 vs Free Models Router NVIDIA: Nemotron Nano 9B V2 vs Google: Gemma 3 12B (free)NVIDIA: Nemotron Nano 9B V2 vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

55.7

#81 of 125

artificial_analysis

MMLU

73.9

#80 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0400

Output (Completion)

$0.1600

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 9B V2 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.92

25M input + 12M output tokens