Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: nvidia· Tokenizer: Other · Quantization: fp8
Canonical ID: nvidia/nemotron-3-super-120b-a12b-20230311
Pricing updated May 3, 2026
Live Pricing
Input: $0.0900
Output: $0.4500
Last synced May 3, 2026
The NVIDIA Nemotron 3 Super features an extensive context window of 262,144 tokens, making it ideal for applications requiring deep contextual understanding, such as legal document analysis and large-scale data summarization. With an input price of $0.10 per million tokens and an output price of $0.50 per million tokens, teams can effectively manage costs while leveraging its capabilities for complex tasks in natural language processing. This model is particularly beneficial for enterprises that need to process and generate large amounts of text efficiently, optimizing both performance and budget.
Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Max Output
—
Not specified
Input Price / 1M
$0.0900
Prompt tokens
Output Price / 1M
$0.4500
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0900
Current Output / 1M
$0.4500
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
96.6%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0900 |
| Output (Completion) | $0.4500 |
Estimate monthly spend for NVIDIA: Nemotron 3 Super based on your workload.
Estimated Monthly Cost
$7.65
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.