Model Cost Profile

NVIDIA: Nemotron Nano 12B 2 VL

Developer: nvidia· Tokenizer: Other · Quantization: fp8

Canonical ID: nvidia/nemotron-nano-12b-v2-vl

Pricing updated Apr 23, 2026

Input rank: #137Output rank: #134

Live Pricing

Input: $0.2000

Output: $0.6000

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

The NVIDIA Nemotron Nano 12B 2 VL model offers a substantial context window of 131,072 tokens, making it suitable for applications requiring extensive input, such as long-form content generation and complex data analysis. With an input price of $0.07 per million tokens and an output price of $0.20 per million tokens, teams can effectively manage costs while leveraging its capabilities for large-scale projects. This model is ideal for organizations looking to enhance their natural language processing tasks, including chatbots and document summarization, without incurring prohibitive expenses.

👁 Vision📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.03

Max Output

—

Not specified

Input Price / 1M

$0.2000

Prompt tokens

Output Price / 1M

$0.6000

Completion tokens

Top Benchmark

64.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for NVIDIA: Nemotron Nano 12B 2 VL. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	43.9	#108 of 125	artificial_analysis
MMLU	64.9	#110 of 127	artificial_analysis

Price History

NVIDIA: Nemotron Nano 12B 2 VL Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.2000

Current Output / 1M

$0.6000

Performance History

NVIDIA: Nemotron Nano 12B 2 VL Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2000
Output (Completion)	$0.6000

Compare with NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Compare with AllenAI: Olmo 3.1 32B Instruct Compare with DeepSeek: DeepSeek V3 0324

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$12

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$12 Free Models Router$0.00−$12 Google: Gemma 3 12B (free)$0.00−$12 Google: Gemma 3 27B (free)$0.00−$12

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron Nano 12B 2 VL vs Arcee AI: Trinity Large Preview (free)NVIDIA: Nemotron Nano 12B 2 VL vs Free Models Router NVIDIA: Nemotron Nano 12B 2 VL vs Google: Gemma 3 12B (free)NVIDIA: Nemotron Nano 12B 2 VL vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

43.9

#108 of 125

artificial_analysis

MMLU

64.9

#110 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2000

Output (Completion)

$0.6000

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$12

25M input + 12M output tokens