Multilingual Voice & Text Telegram Bot with ElevenLabs TTS and LangChain Agents
Go to WorkflowDescription
This workflow contains community nodes that are only compatible with the self-hosted version of n8n.
Telegram Voice AI Assistant
This n8n template creates a multimodal Telegram bot that dynamically responds to users:
Replies with voice** when receiving voice messages (using ElevenLabs TTS)
Replies with text** for text-based queries
Supports custom AI tools (e.g., crypto APIs, databases, or custom functions)
Built with LangChain Agents, it can integrate any external API or data source into conversations.
Key Features
🎙️ Smart Response Logic
Voice Query? → Voice Reply**
Transcribes audio via ElevenLabs STT
Processes with AI (Groq/Gemini)
Converts text response to natural speech (ElevenLabs TTS)
Text Query? → Text Reply**
Bypasses TTS/STT for faster responses
🛠️ Extensible AI Tools
Add your own tools:
Database lookups
Weather/stock APIs
Custom Python functions
RAG (document retrieval)
Supports multi-step tool chaining (e.g., "Get BTC price → analyze trends → summarize")
🌐 Language & Context
Auto-detects user language (via Telegram’s language_code)
Maintains session memory (remembers conversation history)
Use Cases
Voice-first customer support**
Crypto/analytics assistants** (e.g., "What’s Ethereum’s current gas fee?")
Multilingual FAQ bots**
Educational tutors** (voice-interactive learning)
Requirements
Telegram Bot Token**
ElevenLabs API Key** (For TTS/STT)
Groq API Key* or *Google Gemini API Key**
Customization Tips
Change AI personality*: Modify the *systemMessage in the Voice Assistant node
Add more models**: Swap Groq/Gemini for OpenAI, Anthropic, etc.
Extend functionality**: Add RAG (Retrieval-Augmented Generation) for document queries
Take this template to create a Siri-like AI assistant for Telegram in minutes! 🚀