Skip to content

strands-deepgram

Community Contribution

This is a community-maintained package that is not owned or supported by the Strands team. Validate and review the package before using it in your project.

Have your own integration? We'd love to add it here too!

strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram's AI platform with 30+ language support.

Installation

pip install strands-deepgram

Usage

from strands import Agent
from strands_deepgram import deepgram

agent = Agent(tools=[deepgram])

# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")

# Text-to-speech
agent("convert this text to speech: Hello world")

# Audio intelligence
agent("analyze sentiment in call.wav")

Key Features

  • Speech-to-Text: 30+ language support and speaker diarization
  • Text-to-Speech: Natural-sounding voices (Aura series)
  • Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
  • Speaker Diarization: Identify and separate different speakers
  • Multi-format Support: WAV, MP3, M4A, FLAC, and more
  • Real-time Processing: Streaming capabilities for live audio

Configuration

DEEPGRAM_API_KEY=your_deepgram_api_key    # Required
DEEPGRAM_DEFAULT_MODEL=nova-3             # Optional
DEEPGRAM_DEFAULT_LANGUAGE=en              # Optional

Get your API key at: console.deepgram.com

Resources