Model · OpenAI · Text
GPT-4o
OpenAI's flagship multimodal model — text, vision, and realtime voice in one model. The default "omni" frontier model.
- Modality
- Text
- License
- Proprietary (Proprietary)
- Context window
- 128,000 tokens
- Released
- May 13, 2024
- Last verified
- May 10, 2026
- Runs locally
- No
- Also handles
- vision, realtime-voice
Strengths
- Native multimodal (text + image + audio)
- Realtime voice surface (200ms response)
- Strong instruction following across modalities
Weaknesses
- Closed weights — cannot be self-hosted
- Realtime API has a separate access tier
Try it
| Where | Type | Notes |
|---|---|---|
| OpenAI API | hosted-api | API key required |
| Azure OpenAI | hosted-api | Enterprise / regional |
| ChatGPT | product | Free tier |
Used in solutions
Version history
- GPT-4o mini Jul 2024
- GPT-4o May 2024 Current
- GPT-4 Turbo Apr 2024 Deprecated
Official sources
- Model card model
- API docs docs
Change log
- — Initial entry.
Esc