Context Window
128,000
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
The NVIDIA Nemotron Nano 9B V2 offers a substantial context window of 128,000 tokens, making it suitable for applications requiring extensive text analysis or long-form content generation. As a free API model, it provides teams with a cost-effective solution for projects that demand high token limits without the burden of input or output fees. This model is ideal for developers looking to integrate advanced natural language processing capabilities into chatbots, document summarization tools, or any application that benefits from processing large volumes of text efficiently.
Context Window
128,000
Tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Intelligence (MMLU)
74.2
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Nemotron Nano 9B V2 (free).
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Estimate monthly spend for NVIDIA: Nemotron Nano 9B V2 (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Nemotron Nano 9B V2 (free).
NVIDIA: Nemotron Nano 9B V2 (free) input pricing is $0.0000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 9B V2 (free) output pricing is $0.0000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 9B V2 (free) supports a context window of 128,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.