Context Window
131,072
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.2000
Output: $0.6000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
The NVIDIA Nemotron Nano 12B 2 VL model offers a substantial context window of 131,072 tokens, making it suitable for applications requiring extensive input, such as long-form content generation and complex data analysis. With an input price of $0.07 per million tokens and an output price of $0.20 per million tokens, teams can effectively manage costs while leveraging its capabilities for large-scale projects. This model is ideal for organizations looking to enhance their natural language processing tasks, including chatbots and document summarization, without incurring prohibitive expenses.
Context Window
131,072
Tokens
Input Price / 1M
$0.2000
Prompt tokens
Output Price / 1M
$0.6000
Completion tokens
Intelligence (MMLU)
64.9
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Nemotron Nano 12B 2 VL.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2000 |
| Output (Completion) | $0.6000 |
Price History
Current Input / 1M
$0.2000
Current Output / 1M
$0.6000
Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL based on your workload.
Estimated Monthly Cost
$12
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Nemotron Nano 12B 2 VL.
NVIDIA: Nemotron Nano 12B 2 VL input pricing is $0.2000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 12B 2 VL output pricing is $0.6000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 12B 2 VL supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.