Model Cost Profile

NVIDIA: Nemotron 3 Nano 30B A3B

Developer: nvidia· Tokenizer: Other · Quantization: fp4

Canonical ID: nvidia/nemotron-3-nano-30b-a3b

Pricing updated Apr 23, 2026

Input rank: #47Output rank: #66

Live Pricing

Input: $0.0500

Output: $0.2000

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

The NVIDIA Nemotron 3 Nano 30B A3B model features an extensive context window of 262,144 tokens, making it well-suited for applications requiring in-depth analysis and long-form content generation. Teams leveraging this API can benefit from a competitive input cost of $0.05 per 1 million tokens and an output cost of $0.20 per 1 million tokens, allowing for scalable usage in projects like chatbots, document summarization, and complex data processing. This pricing structure enables organizations to optimize their budget while harnessing advanced AI capabilities for diverse business needs.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

262,144

Input tokens

Full-context input ≈ $0.01

Max Output

—

Not specified

Input Price / 1M

$0.0500

Prompt tokens

Output Price / 1M

$0.2000

Completion tokens

Top Benchmark

79.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for NVIDIA: Nemotron 3 Nano 30B A3B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	75.7	#37 of 125	artificial_analysis
MMLU	79.4	#53 of 127	artificial_analysis

Price History

NVIDIA: Nemotron 3 Nano 30B A3B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0500

Current Output / 1M

$0.2000

Performance History

NVIDIA: Nemotron 3 Nano 30B A3B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0500
Output (Completion)	$0.2000

Compare with NVIDIA: Nemotron Nano 9B V2 Compare with Mistral: Mistral Small 3 Compare with OpenAI: GPT-5 Nano

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Nano 30B A3B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$3.65

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$3.65 Free Models Router$0.00−$3.65 Google: Gemma 3 12B (free)$0.00−$3.65 Google: Gemma 3 27B (free)$0.00−$3.65

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron 3 Nano 30B A3B vs Arcee AI: Trinity Large Preview (free)NVIDIA: Nemotron 3 Nano 30B A3B vs Free Models Router NVIDIA: Nemotron 3 Nano 30B A3B vs Google: Gemma 3 12B (free)NVIDIA: Nemotron 3 Nano 30B A3B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

75.7

#37 of 125

artificial_analysis

MMLU

79.4

#53 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0500

Output (Completion)

$0.2000

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Nano 30B A3B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$3.65

25M input + 12M output tokens