RAG-Powered AI Voice Customer Support Agent (Supabase + Gemini + ElevenLabs)

Go to Workflow
0 views
Built by iamvaar iamvaar
Created on June 13, 2026

Description

Execution video: Youtube Link

I built an AI voice-triggered RAG assistant where ElevenLabs’ conversational model acts as the front end and n8n handles the brain....here’s the real breakdown of what’s happening in that workflow:

Webhook (/inf)

Gets hit by ElevenLabs once the user finishes talking.
Payload includes user_question.

Embed User Message (Together API - BAAI/bge-large-en-v1.5)

Turns the spoken question into a dense vector embedding.
This embedding is the query representation for semantic search.

Search Embeddings (Supabase RPC)

Calls matchembeddings1 to find the top 5 most relevant context chunks from your stored knowledge base.

Aggregate

Merges all retrieved chunk values into one block of text so the LLM gets full context at once.

Basic LLM Chain (LangChain node)

Prompt forces the model to only answer from the retrieved context and to sound human-like without saying “based on the context”....
Uses Google Vertex Gemini 2.5 Flash as the actual model.

Respond to Webhook

Sends the generated answer back instantly to the webhook call, so ElevenLabs can speak it back.

You essentially have:
Voice → Text → Embedding → Vector Search → Context Injection → LLM → Response → Voice

Nodes Used (6)

Basic LLM Chain
@n8n/n8n-nodes-langchain.chainLlm
Code
n8n-nodes-base.code
Google Docs
n8n-nodes-base.googleDocs
Google Vertex Chat Model
@n8n/n8n-nodes-langchain.lmChatGoogleVertex
HTTP Request
n8n-nodes-base.httpRequest
Supabase
n8n-nodes-base.supabase