Blog posts tagged
Observability

Browse posts by date or tag

Average response time metrics for OpenAI GPT‑5 models (by reasoning level)

A practical look at typical end‑to‑end latencies when calling GPT‑5 models via the OpenAI API, how reasoning level affects time‑to‑first‑token and total completion time, and what you can do to measure and optimise it.