Overview

AWSNovaSonicLLMService enables natural, real-time conversations with AWS Nova Sonic. It provides built-in audio transcription, voice activity detection, and context management for creating interactive AI experiences with bidirectional audio streaming, text generation, and function calling capabilities.

Installation

To use AWS Nova Sonic services, install the required dependencies:
pip install "pipecat-ai[aws-nova-sonic]"

Prerequisites

AWS Account Setup

Before using AWS Nova Sonic services, you need:
  1. AWS Account: Set up at AWS Console
  2. Bedrock Access: Enable AWS Bedrock service in your region
  3. Model Access: Request access to Nova Sonic models in Bedrock
  4. IAM Credentials: Configure AWS access keys with Bedrock permissions

Required Environment Variables

  • AWS_SECRET_ACCESS_KEY: Your AWS secret access key
  • AWS_ACCESS_KEY_ID: Your AWS access key ID
  • AWS_REGION: AWS region where Bedrock is available

Key Features

  • Real-time Speech-to-Speech: Direct audio input to audio output processing
  • Built-in Transcription: Automatic speech-to-text with real-time streaming
  • Voice Activity Detection: Automatic detection of speech start/stop
  • Function Calling: Support for external function and API integration
  • Multiple Voices: Choose from matthew, tiffany, and amy voice options