Translate English Scripts to Multilingual Audio with GPT-4 and ElevenLabs
Go to WorkflowDescription
π Description
Automate multilingual voice content creation by translating an English script into multiple languages and generating natural-sounding audio files using AI. πποΈ This workflow receives a script via webhook, translates it into selected target languages using GPT-4, converts each translation into high-quality speech with ElevenLabs, and uploads the resulting MP3 files to Google Drive. Itβs ideal for teams producing localized videos, podcasts, courses, or educational content at scale β without manual translation or recording. πβ¨
π What This Template Does
1οΈβ£ Receives an English script and target languages via webhook. π
2οΈβ£ Extracts and normalizes request parameters for processing. π
3οΈβ£ Uses GPT-4 with a strict JSON schema to translate the script into multiple languages. π€π
4οΈβ£ Splits each translated language into individual processing items. π
5οΈβ£ Converts each translation into natural-sounding speech using ElevenLabs. ποΈ
6οΈβ£ Processes translations sequentially to avoid API limits. β³
7οΈβ£ Uploads language-specific MP3 files to Google Drive with clean naming. βοΈ
8οΈβ£ Returns generated audio as a webhook response for frontend usage. π
9οΈβ£ Triggers Slack alerts if any workflow error occurs. π¨
β Key Benefits
β
Instantly localizes audio content into multiple languages
β
Produces human-quality voiceovers without recording studios
β
Ensures consistent tone and meaning across all translations
β
Fully automated from script input to audio delivery
β
Scales easily for videos, podcasts, training, and education
β
Clean Google Drive storage for easy access and reuse
π§© Features
Webhook-based script ingestion
GPT-4 multilingual translation with structured JSON output
AI memory for consistent translation behavior
ElevenLabs multilingual text-to-speech generation
Batch processing for multiple languages
Google Drive MP3 upload with language-based filenames
Webhook audio response for frontend apps
Slack-based error handling and alerts
π Requirements
OpenAI API key (GPT-4 / GPT-4.1-mini)
ElevenLabs API key (multilingual voice model)
Google Drive OAuth2 credentials
Slack API credentials (for error alerts)
Webhook client sending Script, first_language, and second_language
π― Target Audience
Content teams creating localized videos or podcasts
EdTech platforms producing multilingual learning material
Marketing teams localizing campaigns across regions
Creators building AI-driven voice pipelines
Automation teams integrating audio localization into products