OpenAI logo

OpenAI

GPT-4o Audio

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

Input / 1M tokens
$2.50
Output / 1M tokens
$10.00
Context window
128K tokens
Provider
OpenAI
Knowledge cutoff
2023-10-31

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec
Time to first token