Context Window
262,144
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: nvidia· Tokenizer: Other · Quantization: unknown
Canonical ID: nvidia/nemotron-3-super-120b-a12b-20230311
Pricing updated Apr 29, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced Apr 29, 2026
The NVIDIA Nemotron 3 Super offers an extensive context window of 262,144 tokens, making it ideal for applications requiring deep contextual understanding, such as long-form content generation and complex data analysis. With a pricing model of $0.00 for both input and output per million tokens, teams can leverage this API without incurring costs, facilitating experimentation and scalability in projects. This model is particularly beneficial for organizations looking to integrate advanced AI capabilities into their workflows without financial barriers, allowing for extensive usage in research, development, and production environments.
Context Window
262,144
Input tokens
Full-context input ≈ $0.00
Max Output
262,144
Completion tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
98.9%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for NVIDIA: Nemotron 3 Super (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.