Overview
GoogleSTTService
provides real-time speech recognition using Google Cloud’s Speech-to-Text V2 API with support for 125+ languages, multiple models, voice activity detection, and advanced features like automatic punctuation and word-level confidence scores.
Google STT API Reference
Pipecat’s API methods for Google Cloud STT integration
Example Implementation
Complete example with Google Cloud services
Google Cloud Documentation
Official Google Cloud Speech-to-Text documentation
Google Cloud Console
Create service accounts and manage API access
Installation
To use Google Cloud Speech services, install the required dependency:Prerequisites
Google Cloud Setup
Before using Google Cloud STT services, you need:- Google Cloud Account: Sign up at Google Cloud Console
- Project Setup: Create a project and enable the Speech-to-Text API
- Service Account: Create a service account with Speech-to-Text permissions
- Authentication: Set up credentials via service account key or Application Default Credentials
Required Environment Variables
GOOGLE_APPLICATION_CREDENTIALS
: Path to your service account key file (recommended)- Or use Application Default Credentials for cloud deployments