Model Cost Profile

Meta: Llama 3.2 11B Vision Instruct

Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8

Canonical ID: meta-llama/llama-3.2-11b-vision-instruct

Pricing updated Apr 24, 2026

Input rank: #149Output rank: #73

Live Pricing

Input: $0.2450

Output: $0.2450

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Meta: Llama 3.2 11B Vision Instruct is designed for teams needing advanced visual understanding and instruction-following capabilities, making it ideal for applications in image analysis and interactive AI systems. With an extensive context window of 131072 tokens, this model supports complex tasks that require significant contextual information, enhancing performance in scenarios like document summarization and multi-turn conversations. At a competitive pricing of $0.05 per million tokens for both input and output, it offers a cost-effective solution for businesses looking to integrate sophisticated AI functionalities into their workflows.

👁 Vision📋 Structured Output

Context Window

131,072

Input tokens

Full-context input ≈ $0.03

Max Output

16,384

Completion tokens

Input Price / 1M

$0.2450

Prompt tokens

Output Price / 1M

$0.2450

Completion tokens

Top Benchmark

46.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Meta: Llama 3.2 11B Vision Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	22.1	#123 of 125	artificial_analysis
MMLU	46.4	#121 of 127	artificial_analysis

Price History

Meta: Llama 3.2 11B Vision Instruct Pricing Trend

Input / 1M tokens+400.0%Output / 1M tokens+400.0%

Current Input / 1M

$0.2450

Current Output / 1M

$0.2450

Performance History

Meta: Llama 3.2 11B Vision Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2450
Output (Completion)	$0.2450

Compare with Meta: Llama Guard 4 12B Compare with Anthropic: Claude 3 Haiku Compare with ByteDance Seed: Seed 1.6

Cost Calculator

Estimate monthly spend for Meta: Llama 3.2 11B Vision Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.07

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$9.07 Free Models Router$0.00−$9.07 Google: Gemma 3 12B (free)$0.00−$9.07 Google: Gemma 3 27B (free)$0.00−$9.07

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Meta: Llama 3.2 11B Vision Instruct vs Baidu: Qianfan-OCR-Fast (free)Meta: Llama 3.2 11B Vision Instruct vs Free Models Router Meta: Llama 3.2 11B Vision Instruct vs Google: Gemma 3 12B (free)Meta: Llama 3.2 11B Vision Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

22.1

#123 of 125

artificial_analysis

MMLU

46.4

#121 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2450

Output (Completion)

$0.2450

Cost Calculator

Estimate monthly spend for Meta: Llama 3.2 11B Vision Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.07

25M input + 12M output tokens