Comment optimiser la latence pour le Conversational AI ?

La latence est ce qui distingue les bonnes applications de Conversational AI des excellentes

Diagram of a speech processing system showing data flow from user input to output speech, including components like telephone network, ASR, VAD, LLM, TTS, and latency indicators.