RAG-Powered AI Voice Customer Support Agent (Supabase + Gemini + ElevenLabs)

0 views

Built by

iamvaar

Created on June 13, 2026

Description

Execution video: Youtube Link

I built an AI voice-triggered RAG assistant where ElevenLabs’ conversational model acts as the front end and n8n handles the brain....here’s the real breakdown of what’s happening in that workflow:

Webhook (/inf)

Gets hit by ElevenLabs once the user finishes talking.
Payload includes user_question.

Embed User Message (Together API - BAAI/bge-large-en-v1.5)

Turns the spoken question into a dense vector embedding.
This embedding is the query representation for semantic search.

Search Embeddings (Supabase RPC)

Calls matchembeddings1 to find the top 5 most relevant context chunks from your stored knowledge base.

Aggregate

Merges all retrieved chunk values into one block of text so the LLM gets full context at once.

Basic LLM Chain (LangChain node)

Prompt forces the model to only answer from the retrieved context and to sound human-like without saying “based on the context”....
Uses Google Vertex Gemini 2.5 Flash as the actual model.

Respond to Webhook

Sends the generated answer back instantly to the webhook call, so ElevenLabs can speak it back.

You essentially have:
Voice → Text → Embedding → Vector Search → Context Injection → LLM → Response → Voice

Nodes Used (6)

Basic LLM Chain

@n8n/n8n-nodes-langchain.chainLlm

Code

n8n-nodes-base.code

Google Docs

n8n-nodes-base.googleDocs

Google Vertex Chat Model

@n8n/n8n-nodes-langchain.lmChatGoogleVertex

HTTP Request

n8n-nodes-base.httpRequest

Supabase

n8n-nodes-base.supabase

RAG-Powered AI Voice Customer Support Agent (Supabase + Gemini + ElevenLabs)

Description

Nodes Used (6)

Select Nodes to Filter