Context Window
128,000
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
The NVIDIA Nemotron Nano 12B 2 VL model offers a substantial context window of 128,000 tokens, making it ideal for applications requiring extensive text analysis or long-form content generation. With no associated input or output costs, teams can leverage this free API model for projects in natural language processing, chatbots, and data extraction without worrying about budget constraints. Its high token capacity allows for complex tasks, enabling users to handle larger datasets and maintain context over extended interactions.
Context Window
128,000
Tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Intelligence (MMLU)
75.9
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Nemotron Nano 12B 2 VL (free).
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Nemotron Nano 12B 2 VL (free).
NVIDIA: Nemotron Nano 12B 2 VL (free) input pricing is $0.0000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 12B 2 VL (free) output pricing is $0.0000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron Nano 12B 2 VL (free) supports a context window of 128,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.