DOCAWS Machine Learning Blog·20d ago
Build real-time voice applications with Amazon SageMaker AI and vLLM
Real-time voice applications, such as voice agents and live captioning, rely on simultaneous speech-to-text transcription. Traditional request-response inference falls short, introducing latency that hinders real-time functionality.
27