Ebook to Audiobook converter using MiniMax and FFmpeg

Go to Workflow
0 views
Built by Jay Emp0 Jay Emp0
Created on June 05, 2026

Description

Ebook to Audiobook Converter



▶️ Watch Full Demo Video

What It Does

Turn any PDF ebook into a professional audiobook automatically. Upload a PDF, get an MP3 audiobook in your Google Drive. Perfect for listening to books, research papers, or documents on the go.

Example: Input PDF → Output Audiobook

Key Features

Upload PDF via web form → Get MP3 audiobook in Google Drive
Natural-sounding AI voices (MiniMax Speech-02-HD)
Automatic text extraction, chunking, and audio merging
Customizable voice, speed, and emotion settings
Processes long books in batches with smart rate limiting

Perfect For

Students**: Turn textbooks into study audiobooks
Professionals**: Listen to reports and documents while commuting
Content Creators**: Repurpose written content as audio
Accessibility**: Make content accessible to visually impaired users

Requirements

| Component | Details |
|-----------|---------|
| n8n | Self-hosted ONLY (cannot run on n8n Cloud) |
| FFmpeg | Must be installed in your n8n environment |
| Replicate API | For MiniMax TTS (Sign up here) |
| Google Drive | OAuth2 credentials + "Audiobook" folder |

⚠️ Important: This workflow does NOT work on n8n Cloud because FFmpeg installation is required.

Quick Setup

1. Install FFmpeg

Docker users:
docker exec -it <n8n-container-name> /bin/bash
apt-get update && apt-get install -y ffmpeg

Native installation:
sudo apt-get install ffmpeg # Linux
brew install ffmpeg # macOS

2. Get API Keys
Replicate**: Sign up at replicate.com and copy your API token
Google Drive**: Set up OAuth2 in n8n and create an "Audiobook" folder in Drive

3. Import & Configure
Import n8n.json into your n8n instance
Replace the Replicate API token in the "MINIMAX TTS" node
Configure Google Drive credentials and select your "Audiobook" folder
Activate the workflow

Cost Estimate

| Component | Cost |
|-----------|------|
| MiniMax TTS API | $0.15 per 1000 characters ($3-5 for average book) |
| Google Drive Storage | Free (up to 15GB) |
| Processing Time | ~1-2 minutes per 10 pages |

How It Works



PDF Upload → Extract Text → Split into Chunks → Convert to Speech (batches of 5)
→ Merge Audio Files (FFmpeg) → Upload to Google Drive

The workflow uses four main modules:
Extraction: PDF text extraction and intelligent chunking
Conversion: MiniMax TTS processes text in batches
Merging: FFmpeg combines all audio files seamlessly
Upload: Final audiobook saved to Google Drive

Voice Settings (Customizable)

{
"voice_id": "Friendly_Person",
"emotion": "happy",
"speed": 1,
"pitch": 0
}

Available emotions: happy, neutral, sad, angry, excited

Limitations

⚠️ Self-hosted n8n ONLY (not compatible with n8n Cloud)
PDF files only (not EPUB, MOBI, or scanned images)
Large books (500+ pages) take longer to process
Requires FFmpeg installation (see setup above)

Troubleshooting

FFmpeg not found?
Docker: Run docker exec -it <container> /bin/bash then apt-get install ffmpeg
Native: Run sudo apt-get install ffmpeg (Linux) or brew install ffmpeg (macOS)

Rate limit errors?
Increase wait time in the "WAITS FOR 5 SECONDS" node to 10-15 seconds

Google Drive upload fails?
Make sure you created the "Audiobook" folder in your Google Drive
Reconfigure OAuth2 credentials in n8n

Created by emp0 | More workflows: n8n Gallery

Nodes Used (3)

Code
n8n-nodes-base.code
Google Drive
n8n-nodes-base.googleDrive
HTTP Request
n8n-nodes-base.httpRequest