Audio ingestion
Use WebSocket streams for audio input and keep buffer sizes small for low latency.
Turn taking
Detect interruptions and decide when to cancel or resume speech output.
Latency budget
Measure every hop in the pipeline and keep total latency under your UX target.