Context Window
131,072
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.0400
Output: $0.1600
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
NVIDIA's Nemotron Nano 9B V2 features an extensive context window of 131,072 tokens, making it suitable for applications that require processing large volumes of text, such as document summarization and complex conversational agents. With an input price of $0.04 per 1M tokens and an output price of $0.16 per 1M tokens, teams can optimize their budget while leveraging advanced AI capabilities for projects like content generation and data analysis. This model is particularly beneficial for organizations needing scalable solutions that handle expansive datasets efficiently.
Context Window
131,072
Tokens
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.1600
Completion tokens
Intelligence (MMLU)
74.2
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Nemotron Nano 9B V2.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.1600 |
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.1600
Estimate monthly spend for NVIDIA: Nemotron Nano 9B V2 based on your workload.
Estimated Monthly Cost
$2.92
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Nemotron Nano 9B V2.
NVIDIA: Nemotron Nano 9B V2 input pricing is $0.0400 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 9B V2 output pricing is $0.1600 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 9B V2 supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.