Context Window
262,144
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.0500
Output: $0.2000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
The NVIDIA Nemotron 3 Nano 30B A3B model features an extensive context window of 262,144 tokens, making it well-suited for applications requiring in-depth analysis and long-form content generation. Teams leveraging this API can benefit from a competitive input cost of $0.05 per 1 million tokens and an output cost of $0.20 per 1 million tokens, allowing for scalable usage in projects like chatbots, document summarization, and complex data processing. This pricing structure enables organizations to optimize their budget while harnessing advanced AI capabilities for diverse business needs.
Context Window
262,144
Tokens
Input Price / 1M
$0.0500
Prompt tokens
Output Price / 1M
$0.2000
Completion tokens
Intelligence (MMLU)
57.9
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Nemotron 3 Nano 30B A3B.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0500 |
| Output (Completion) | $0.2000 |
Price History
Current Input / 1M
$0.0500
Current Output / 1M
$0.2000
Estimate monthly spend for NVIDIA: Nemotron 3 Nano 30B A3B based on your workload.
Estimated Monthly Cost
$3.65
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Nemotron 3 Nano 30B A3B.
NVIDIA: Nemotron 3 Nano 30B A3B input pricing is $0.0500 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron 3 Nano 30B A3B output pricing is $0.2000 per 1M tokens based on the latest synced provider data.
NVIDIA: Nemotron 3 Nano 30B A3B supports a context window of 262,144 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.