Context Window
128,000
Input tokens
Full-context input โ $0.32
Model Cost Profile
Developer: openaiยท Tokenizer: GPT ยท Quantization: unknown
Canonical ID: openai/gpt-4o-audio-preview
Pricing updated Apr 24, 2026
OpenAI's GPT-4o Audio model is designed for applications requiring extensive context, with a remarkable 128,000 token context window, making it ideal for generating long-form audio content and transcriptions. Teams leveraging this API model can expect input costs of $2.50 per million tokens and output costs of $10.00 per million tokens, which can significantly impact budget planning for high-volume audio processing projects. This model is particularly beneficial for industries such as media, education, and entertainment, where nuanced understanding and generation of audio data are critical for user engagement and content creation.
Provider Compliance
Context Window
128,000
Input tokens
Full-context input โ $0.32
Max Output
16,384
Completion tokens
Input Price / 1M
$2.50
Prompt tokens
Output Price / 1M
$10.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$2.50
Current Output / 1M
$10.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $2.50 |
| Output (Completion) | $10.00 |
| Audio | $40.00 |
Estimate monthly spend for OpenAI: GPT-4o Audio based on your workload.
Estimated Monthly Cost
$183
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.