Generate Videos from Text Prompts using GPT-5 and Google Veo-3

Go to Workflow
0 views
Built by AI/ML API | D1m7asis AI/ML API | D1m7asis
Created on June 07, 2026

Description

This workflow contains community nodes that are only compatible with the self-hosted version of n8n.

🎬 GPT-5 Cinematic Video Generator (n8n + AI/ML API + Google Veo-3)
This n8n workflow transforms even a single word into a fully rendered cinematic video using Google Veo-3 image-to-video and prompt expansion with GPT-5. Ideal for rapid creative prototyping, content creation, and AI-driven video production.

🚀 Key Features

Ultra-Short Input Ready** – Works with just one keyword or a short idea.
AI Prompt Expansion** – GPT-5 adds cinematic elements: camera motion, mood, color palette, and composition.
Automated Video Generation** – Veo-3 i2v model creates smooth, visually appealing clips.
Smart Polling** – Waits until video is ready, with success/fail handling.
Direct Output** – Returns ready-to-use video URL for download or publishing.

🛠 Setup Guide

Create AI/ML API Credentials

Get your API key from AI/ML API Keys.
In n8n > Credentials, add AI/ML account (Bearer token).

(Optional) Change Image Source

Edit the Set image URL node to point to your desired still image for video generation.

Run from Chat

Send a word or short phrase via Chat Trigger to start the process.

(Optional) Extend Output

Add Google Drive upload, YouTube publish, or database logging nodes for a complete content pipeline.

💡 How It Works

Trigger – Receives chat input (keyword or brief).
Prompt Expansion – GPT-5 turns it into a cinematic, Veo-3-optimized description.
Video Creation – Sends prompt + image to Veo-3 image-to-video.
Polling – Checks generation status every 30 seconds until complete.
Result – Returns direct video URL, ready for your next steps.

Nodes Used (1)

HTTP Request
n8n-nodes-base.httpRequest