Model Cost Profile

NVIDIA: Nemotron 3 Nano 30B A3B (free)

Developer: nvidia· Tokenizer: Other · Quantization: bf16

Canonical ID: nvidia/nemotron-3-nano-30b-a3b

Pricing updated Apr 24, 2026

Input rank: #20Output rank: #20

Live Pricing

Input: $0.0000

Output: $0.0000

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

The NVIDIA Nemotron 3 Nano 30B A3B model offers a substantial context window of 256,000 tokens, making it ideal for applications requiring extensive text comprehension and generation, such as document summarization and conversational AI. Given its free access, teams can leverage this model without incurring input or output costs, significantly lowering the barrier for experimentation and deployment in various projects. This model is particularly suitable for startups and research teams looking to integrate advanced AI capabilities without budget constraints, facilitating innovation in natural language processing tasks.

🔧 Tool Calling🔌 MCP Compatible🧠 Reasoning

Context Window

256,000

Input tokens

Full-context input ≈ $0.00

Max Output

—

Not specified

Input Price / 1M

$0.0000

Prompt tokens

Output Price / 1M

$0.0000

Completion tokens

Top Benchmark

79.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for NVIDIA: Nemotron 3 Nano 30B A3B (free). The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	75.7	#36 of 125	artificial_analysis
MMLU	79.4	#49 of 127	artificial_analysis

Price History

NVIDIA: Nemotron 3 Nano 30B A3B (free) Pricing Trend

Input / 1M tokensOutput / 1M tokens

Current Input / 1M

$0.000000

Current Output / 1M

$0.000000

Performance History

NVIDIA: Nemotron 3 Nano 30B A3B (free) Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0000
Output (Completion)	$0.0000

Compare with NVIDIA: Nemotron 3 Super (free)Compare with Baidu: Qianfan-OCR-Fast (free)Compare with Free Models Router

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Nano 30B A3B (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00 Free Models Router$0.00 Google: Gemma 3 12B (free)$0.00 Google: Gemma 3 27B (free)$0.00

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron 3 Nano 30B A3B (free) vs Baidu: Qianfan-OCR-Fast (free)NVIDIA: Nemotron 3 Nano 30B A3B (free) vs Free Models Router NVIDIA: Nemotron 3 Nano 30B A3B (free) vs Google: Gemma 3 12B (free)NVIDIA: Nemotron 3 Nano 30B A3B (free) vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

75.7

#36 of 125

artificial_analysis

MMLU

79.4

#49 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0000

Output (Completion)

$0.0000

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron 3 Nano 30B A3B (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens