Overview

OpenAISTTService provides high-accuracy speech recognition using OpenAI’s advanced transcription models, including the latest GPT-4o transcription model and the proven Whisper API. It uses Voice Activity Detection (VAD) to efficiently process speech segments with superior accuracy and context understanding.

Installation

To use OpenAI services, install the required dependency:
pip install "pipecat-ai[openai]"

Prerequisites

OpenAI Account Setup

Before using OpenAI STT services, you need:
  1. OpenAI Account: Sign up at OpenAI Platform
  2. API Key: Generate an API key from your account dashboard
  3. Model Access: Ensure access to Whisper and GPT-4o transcription models

Required Environment Variables

  • OPENAI_API_KEY: Your OpenAI API key for authentication