Model Cost Profile

NVIDIA: Nemotron Nano 12B 2 VL (free)

Developer: nvidia· Tokenizer: Other · Quantization: unknown

Canonical ID: nvidia/nemotron-nano-12b-v2-vl

Pricing updated Apr 23, 2026

Input rank: #21Output rank: #21

Live Pricing

Input: $0.0000

Output: $0.0000

Visit NVIDIA ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

The NVIDIA Nemotron Nano 12B 2 VL model offers a substantial context window of 128,000 tokens, making it ideal for applications requiring extensive text analysis or long-form content generation. With no associated input or output costs, teams can leverage this free API model for projects in natural language processing, chatbots, and data extraction without worrying about budget constraints. Its high token capacity allows for complex tasks, enabling users to handle larger datasets and maintain context over extended interactions.

👁 Vision🔧 Tool Calling🔌 MCP Compatible🧠 Reasoning

Context Window

128,000

Input tokens

Full-context input ≈ $0.00

Max Output

128,000

Completion tokens

Input Price / 1M

$0.0000

Prompt tokens

Output Price / 1M

$0.0000

Completion tokens

Top Benchmark

75.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for NVIDIA: Nemotron Nano 12B 2 VL (free). The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	57.2	#78 of 125	artificial_analysis
MMLU	75.9	#69 of 127	artificial_analysis

Price History

NVIDIA: Nemotron Nano 12B 2 VL (free) Pricing Trend

Input / 1M tokensOutput / 1M tokens

Current Input / 1M

$0.000000

Current Output / 1M

$0.000000

Performance History

NVIDIA: Nemotron Nano 12B 2 VL (free) Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

98.3%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0000
Output (Completion)	$0.0000

Compare with NVIDIA: Nemotron 3 Nano 30B A3B (free)Compare with Arcee AI: Trinity Large Preview (free)Compare with Free Models Router

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00 Free Models Router$0.00 Google: Gemma 3 12B (free)$0.00 Google: Gemma 3 27B (free)$0.00

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

NVIDIA: Nemotron Nano 12B 2 VL (free) vs Arcee AI: Trinity Large Preview (free)NVIDIA: Nemotron Nano 12B 2 VL (free) vs Free Models Router NVIDIA: Nemotron Nano 12B 2 VL (free) vs Google: Gemma 3 12B (free)NVIDIA: Nemotron Nano 12B 2 VL (free) vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

57.2

#78 of 125

artificial_analysis

MMLU

75.9

#69 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0000

Output (Completion)

$0.0000

Cost Calculator

Estimate monthly spend for NVIDIA: Nemotron Nano 12B 2 VL (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens