Context Window
4,095
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: openai· Tokenizer: GPT · Instruct: chatml · Quantization: unknown
Canonical ID: openai/gpt-3.5-turbo-instruct
Pricing updated Apr 24, 2026
OpenAI's GPT-3.5 Turbo Instruct is designed for applications requiring advanced natural language understanding and generation, making it suitable for chatbots, content creation, and code assistance. With a context window of 4095 tokens, teams can manage extensive prompts and responses, enhancing the quality of interactions in complex scenarios. Pricing for this model is set at $1.50 per million input tokens and $2.00 per million output tokens, allowing organizations to budget effectively based on their usage patterns.
Provider Compliance
Context Window
4,095
Input tokens
Full-context input ≈ $0.01
Max Output
4,096
Completion tokens
Input Price / 1M
$1.50
Prompt tokens
Output Price / 1M
$2.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$1.50
Current Output / 1M
$2.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $1.50 |
| Output (Completion) | $2.00 |
Estimate monthly spend for OpenAI: GPT-3.5 Turbo Instruct based on your workload.
Estimated Monthly Cost
$62
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.