Overview

GoogleSTTService provides real-time speech recognition using Google Cloud’s Speech-to-Text V2 API with support for 125+ languages, multiple models, voice activity detection, and advanced features like automatic punctuation and word-level confidence scores.

Installation

To use Google Cloud Speech services, install the required dependency:
pip install "pipecat-ai[google]"

Prerequisites

Google Cloud Setup

Before using Google Cloud STT services, you need:
  1. Google Cloud Account: Sign up at Google Cloud Console
  2. Project Setup: Create a project and enable the Speech-to-Text API
  3. Service Account: Create a service account with Speech-to-Text permissions
  4. Authentication: Set up credentials via service account key or Application Default Credentials

Required Environment Variables

  • GOOGLE_APPLICATION_CREDENTIALS: Path to your service account key file (recommended)
  • Or use Application Default Credentials for cloud deployments