Overview
AWSNovaSonicLLMService
enables natural, real-time conversations with AWS Nova Sonic. It provides built-in audio transcription, voice activity detection, and context management for creating interactive AI experiences with bidirectional audio streaming, text generation, and function calling capabilities.
AWS Nova Sonic API Reference
Pipecat’s API methods for AWS Nova Sonic integration
Example Implementation
Complete AWS Nova Sonic conversation example
AWS Bedrock Documentation
Official AWS Bedrock and Nova Sonic documentation
AWS Console
Access AWS Bedrock and manage Nova Sonic models
Installation
To use AWS Nova Sonic services, install the required dependencies:Prerequisites
AWS Account Setup
Before using AWS Nova Sonic services, you need:- AWS Account: Set up at AWS Console
- Bedrock Access: Enable AWS Bedrock service in your region
- Model Access: Request access to Nova Sonic models in Bedrock
- IAM Credentials: Configure AWS access keys with Bedrock permissions
Required Environment Variables
AWS_SECRET_ACCESS_KEY
: Your AWS secret access keyAWS_ACCESS_KEY_ID
: Your AWS access key IDAWS_REGION
: AWS region where Bedrock is available
Key Features
- Real-time Speech-to-Speech: Direct audio input to audio output processing
- Built-in Transcription: Automatic speech-to-text with real-time streaming
- Voice Activity Detection: Automatic detection of speech start/stop
- Function Calling: Support for external function and API integration
- Multiple Voices: Choose from matthew, tiffany, and amy voice options