Context Window
131,072
Tokens
Model Cost Profile
Developer: nvidia
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.1000
Output: $0.4000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
The NVIDIA Llama 3.3 Nemotron Super 49B V1.5 model is designed for advanced natural language processing tasks, making it suitable for applications in chatbots, content generation, and data analysis. With an extensive context window of 131,072 tokens, teams can manage larger datasets and maintain context over longer conversations, enhancing user experience and accuracy. The pricing structure, at $0.10 per million tokens for input and $0.40 for output, allows organizations to budget effectively based on their specific usage needs and project scale.
Context Window
131,072
Tokens
Input Price / 1M
$0.1000
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Intelligence (MMLU)
69.8
Massive Multitask Language Understanding
Standardized evaluation scores for NVIDIA: Llama 3.3 Nemotron Super 49B V1.5.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1000 |
| Output (Completion) | $0.4000 |
Price History
Current Input / 1M
$0.1000
Current Output / 1M
$0.4000
Estimate monthly spend for NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 based on your workload.
Estimated Monthly Cost
$7.30
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for NVIDIA: Llama 3.3 Nemotron Super 49B V1.5.
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 input pricing is $0.1000 per 1M tokens based on the latest synced provider data.
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 output pricing is $0.4000 per 1M tokens based on the latest synced provider data.
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.