🎙️ VoiceFlow AI: Telegram + Deepgram + OpenAI + Supabase Audio Assistant

Go to Workflow
516 views
Built by RedOne RedOne
Created on June 06, 2026

Description

🎙️ AI Audio Assistant with Voice-to-Voice Response

Who is this for?
Businesses, customer service teams, content creators, and organizations who want to provide intelligent voice-based interactions through Telegram. Perfect for accessibility-focused services, multilingual support, or hands-free customer assistance.

What problem does this solve?
Enables natural voice conversations with AI
Breaks down language and accessibility barriers
Provides instant voice responses to customer queries
Reduces typing requirements for users
Offers 24/7 voice-based customer support
Maintains conversation context across voice interactions

What this workflow does:
Receives voice messages via Telegram bot
Transcribes audio using Deepgram's advanced speech-to-text
Processes transcribed text through AI agent with knowledge base access
Generates intelligent responses based on conversation context
Converts AI response to natural-sounding speech using Deepgram TTS
Sends audio response back to user via Telegram
Maintains conversation memory for contextual interactions

🔧 Technical Architecture

Core Components:
Telegram Bot**: Receives and sends voice messages
Deepgram STT**: Transcribes voice to text with high accuracy
OpenAI GPT**: Processes queries and generates responses
Supabase Knowledge Base**: Stores and retrieves business information
Memory Management**: Maintains conversation context
Deepgram TTS**: Converts text responses to natural speech

Data Flow:
Voice Message → Telegram API → File Download
Audio File → Deepgram STT → Transcript
Transcript → AI Agent → Response Generation
Response → Deepgram TTS → Audio File
Audio Response → Telegram → User

🛠️ Setup Instructions

Prerequisites
Telegram Bot Token
Create bot via @BotFather
Get bot token and configure webhook

Deepgram API Key
Sign up at deepgram.com
Get API key for STT and TTS services
Note: Currently hardcoded in workflow

OpenAI API Key
OpenAI account with API access
Configure in OpenAI Chat Model node

Supabase Database
Create Supabase project
Set up knowledge_base table
Configure API credentials

Step-by-Step Setup

Configure Telegram Bot
Update telegramToken in "Prepare Voice Message Data" node
Set correct bot token in Telegram nodes
Test bot connectivity

Set Up Deepgram Integration
Replace API key in "Transcribe with Deepgram" node
Update TTS endpoint in "HTTP Request" node
Test voice transcription accuracy

Configure Knowledge Base
-- Create knowledge_base table in Supabase
CREATE TABLE knowledge_base (
id UUID DEFAULT gen_random_uuid() PRIMARY KEY,
question TEXT NOT NULL,
answer TEXT NOT NULL,
category VARCHAR(100),
keywords TEXT[],
created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
);

Customize AI Prompts
Update system message in "Telegram AI Agent" node
Adjust temperature and max tokens in OpenAI model
Configure memory session keys

Test End-to-End Flow
Send test voice message to bot
Verify transcription accuracy
Check AI response quality
Validate audio output clarity

🎛️ Configuration Options

Voice Recognition Settings
Model**: nova-2 (Deepgram's latest model)
Language**: English (en) - can be changed
Smart Format**: Enabled for better punctuation

AI Response Settings
Temperature**: 0.3 (conservative responses)
Max Tokens**: 100 (adjust based on needs)
Memory**: Session-based conversation context

Text-to-Speech Settings
Model**: aura-2-thalia-en (natural female voice)
Alternative voices**: Available in Deepgram TTS API
Audio Format**: Optimized for Telegram

🔒 Security Considerations

API Key Management
// Current implementation has hardcoded tokens
// Recommended: Use environment variables
const telegramToken = process.env.TELEGRAM_BOT_TOKEN;
const deepgramKey = process.env.DEEPGRAM_API_KEY;

Data Privacy
Voice messages are processed by external APIs
Consider data retention policies
Implement user consent mechanisms
Ensure GDPR compliance if applicable

📊 Monitoring & Analytics

Key Metrics to Track
Voice message processing time
Transcription accuracy rates
AI response quality scores
User engagement metrics
Error rates and failure points

Recommended Logging
// Add to workflow for monitoring
console.log({
timestamp: new Date().toISOString(),
user_id: userData.user_id,
transcript_confidence: transcriptData.confidence,
response_length: aiResponse.length,
processing_time: processingTime
});

🚀 Customization Ideas

Enhanced Features
Multi-language Support
Add language detection
Support multiple TTS voices
Translate responses

Voice Commands
Implement wake words
Add voice shortcuts
Create voice menus

Advanced AI Features
Sentiment analysis
Intent classification
Escalation triggers

Integration Expansions
Connect to CRM systems
Add calendar scheduling
Integrate with help desk tools

Performance Optimizations
Implement audio preprocessing
Add response caching
Optimize API call sequences
Implement retry mechanisms

🐛 Troubleshooting

Common Issues

Voice Not Transcribing
Check Deepgram API key validity
Verify audio format compatibility
Test with shorter voice messages

Poor Audio Quality
Adjust TTS model settings
Check network connectivity
Verify Telegram audio limits

AI Responses Too Generic
Improve knowledge base content
Adjust system prompts
Increase context window

Memory Not Working
Check session key configuration
Verify user ID extraction
Test conversation continuity

💡 Best Practices

Voice Interface Design
Keep responses concise and clear
Use natural speech patterns
Avoid technical jargon
Provide clear next steps

Knowledge Base Management
Regular content updates
Clear categorization
Keyword optimization
Quality assurance testing

User Experience
Fast response times (<5 seconds)
Consistent voice personality
Graceful error handling
Clear capability communication

📈 Success Metrics

Technical KPIs
Response time: <3 seconds average
Transcription accuracy: >95%
User satisfaction: >4.5/5
Uptime: >99.5%

Business KPIs
Customer query resolution rate
Support ticket reduction
User engagement increase
Cost per interaction decrease

🔄 Maintenance Schedule

Daily
Monitor error logs
Check API rate limits
Verify service uptime

Weekly
Review conversation quality
Update knowledge base
Analyze usage patterns

Monthly
Performance optimization
Security audit
Feature updates
User feedback review

📚 Additional Resources

Documentation Links
Deepgram STT API
Deepgram TTS API
Telegram Bot API
OpenAI API
Supabase Documentation

Community Support
n8n Community Forum
Telegram Bot Developers Group
Deepgram Developer Discord
OpenAI Developer Community

Note: This template requires active API subscriptions for Deepgram and OpenAI services. Costs may apply based on usage volume.

Nodes Used (5)

AI Agent
@n8n/n8n-nodes-langchain.agent
HTTP Request
n8n-nodes-base.httpRequest
OpenAI Chat Model
@n8n/n8n-nodes-langchain.lmChatOpenAi
Simple Memory
@n8n/n8n-nodes-langchain.memoryBufferWindow
Telegram
n8n-nodes-base.telegram