Context Window
131,072
Input tokens
Full-context input ≈ $0.03
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: meta-llama/llama-3.2-11b-vision-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.2450
Output: $0.2450
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Meta: Llama 3.2 11B Vision Instruct is designed for teams needing advanced visual understanding and instruction-following capabilities, making it ideal for applications in image analysis and interactive AI systems. With an extensive context window of 131072 tokens, this model supports complex tasks that require significant contextual information, enhancing performance in scenarios like document summarization and multi-turn conversations. At a competitive pricing of $0.05 per million tokens for both input and output, it offers a cost-effective solution for businesses looking to integrate sophisticated AI functionalities into their workflows.
Context Window
131,072
Input tokens
Full-context input ≈ $0.03
Max Output
16,384
Completion tokens
Input Price / 1M
$0.2450
Prompt tokens
Output Price / 1M
$0.2450
Completion tokens
Top Benchmark
46.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Meta: Llama 3.2 11B Vision Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.2450
Current Output / 1M
$0.2450
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2450 |
| Output (Completion) | $0.2450 |
Estimate monthly spend for Meta: Llama 3.2 11B Vision Instruct based on your workload.
Estimated Monthly Cost
$9.07
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.