Model Cost Profile

NVIDIA: Nemotron 3 Super

Developer: nvidia· Tokenizer: Other · Quantization: fp8

Canonical ID: nvidia/nemotron-3-super-120b-a12b-20230311

Pricing updated May 3, 2026

Input rank: #80Output rank: #120

Live Pricing

Input: $0.0900

Output: $0.4500

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced May 3, 2026

The NVIDIA Nemotron 3 Super features an extensive context window of 262,144 tokens, making it ideal for applications requiring deep contextual understanding, such as legal document analysis and large-scale data summarization. With an input price of $0.10 per million tokens and an output price of $0.50 per million tokens, teams can effectively manage costs while leveraging its capabilities for complex tasks in natural language processing. This model is particularly beneficial for enterprises that need to process and generate large amounts of text efficiently, optimizing both performance and budget.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.0900

Prompt tokens

Output Price / 1M

$0.4500

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

NVIDIA: Nemotron 3 Super Pricing Trend

Input / 1M tokens-70.0%Output / 1M tokens-50.0%

Current Input / 1M

$0.0900

Current Output / 1M

$0.4500

Performance History

NVIDIA: Nemotron 3 Super Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

96.6%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0900
Output (Completion)	$0.4500

Compare with NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Compare with Qwen: Qwen3 30B A3B Instruct 2507 Compare with Qwen: Qwen3 Next 80B A3B Instruct

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Super based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.65

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$7.65 Free Models Router$0.00−$7.65 Google: Gemma 3 12B (free)$0.00−$7.65 Google: Gemma 3 27B (free)$0.00−$7.65

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron 3 Super vs Baidu: Qianfan-OCR-Fast (free)NVIDIA: Nemotron 3 Super vs Free Models Router NVIDIA: Nemotron 3 Super vs Google: Gemma 3 12B (free)NVIDIA: Nemotron 3 Super vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0900

Output (Completion)

$0.4500

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Super based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.65

25M input + 12M output tokens