strands-deepgram¶
Community Contribution
This is a community-maintained package that is not owned or supported by the Strands team. Validate and review the package before using it in your project.
Have your own integration? We'd love to add it here too!
strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram's AI platform with 30+ language support.
Installation¶
pip install strands-deepgram
Usage¶
from strands import Agent
from strands_deepgram import deepgram
agent = Agent(tools=[deepgram])
# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")
# Text-to-speech
agent("convert this text to speech: Hello world")
# Audio intelligence
agent("analyze sentiment in call.wav")
Key Features¶
- Speech-to-Text: 30+ language support and speaker diarization
- Text-to-Speech: Natural-sounding voices (Aura series)
- Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
- Speaker Diarization: Identify and separate different speakers
- Multi-format Support: WAV, MP3, M4A, FLAC, and more
- Real-time Processing: Streaming capabilities for live audio
Configuration¶
DEEPGRAM_API_KEY=your_deepgram_api_key # Required
DEEPGRAM_DEFAULT_MODEL=nova-3 # Optional
DEEPGRAM_DEFAULT_LANGUAGE=en # Optional
Get your API key at: console.deepgram.com