Model Cost Profile

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Developer: nvidia

Pricing updated Mar 11, 2026

Input rank: #91Output rank: #105

Live Pricing

Input: $0.1000

Output: $0.4000

Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data

The NVIDIA Llama 3.3 Nemotron Super 49B V1.5 model is designed for advanced natural language processing tasks, making it suitable for applications in chatbots, content generation, and data analysis. With an extensive context window of 131,072 tokens, teams can manage larger datasets and maintain context over longer conversations, enhancing user experience and accuracy. The pricing structure, at $0.10 per million tokens for input and $0.40 for output, allows organizations to budget effectively based on their specific usage needs and project scale.

๐Ÿ”ง Tool Calling๐Ÿ“‹ Structured Output๐Ÿง  Reasoning

Context Window

131,072

Tokens

Input Price / 1M

$0.1000

Prompt tokens

Output Price / 1M

$0.4000

Completion tokens

Intelligence (MMLU)

69.8

Massive Multitask Language Understanding

Benchmark Scores

Standardized evaluation scores for NVIDIA: Llama 3.3 Nemotron Super 49B V1.5.

BenchmarkScoreRankSource
GPQA51.7#80 of 118artificial_analysis
MMLU69.8#87 of 121artificial_analysis

Price History

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%
Mar 7 โ€” Mar 11
$0.1000$0.2500$0.4000Mar 7Mar 8Mar 9Mar 10Mar 11

Current Input / 1M

$0.1000

Current Output / 1M

$0.4000

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

FAQ

Common pricing and benchmark questions for NVIDIA: Llama 3.3 Nemotron Super 49B V1.5.

How much does NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 cost per 1M input tokens?

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 input pricing is $0.1000 per 1M tokens based on the latest synced provider data.

How much does NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 cost per 1M output tokens?

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 output pricing is $0.4000 per 1M tokens based on the latest synced provider data.

What context window does NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 support?

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 supports a context window of 131,072 tokens.

How can I compare NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.