TheDirector
Details
Download Files
About this version
Model description
π¬ THE DIRECTOR β AI Image, Script & Video Generation Tool for Creators
v2.0 β April 16, 2025
Built for influencers, storytellers, educators, and visual content pros.
Turn ideas into cinematic short scenes β with images, scripts, voiceovers, and sound effects β all in one seamless AI pipeline.
π₯ Walkthrough video: https://youtu.be/L7SYD_pbraA
π Whatβs New in v2.0
β MMAAudio Support β auto-generates matching sound/music for each scene
π Faster Rendering β scenes now complete in under 4 minutes on Ultra Pro
π§ Start from Text or Image β describe your character or provide a visual
ποΈ Improved Scene Generation β better detail, consistency, and flow
π οΈ Modified Interface β cleaner, more intuitive node layout
π§© Seamless integration with ComfyUI
π Complete 3-scene video with audio in under 17 minutes
π Version History
v1.0 β March 27, 2025
β’ Initial Release β basic scene generation and rendering engine
π§ STEP-BY-STEP WORKFLOW GUIDE
πΉ Step 1 β Get a Gemini API Key
β’ Go to: https://aistudio.google.com/apikey
β’ Log in with your Google account
β’ Click "Create API Key"
β’ Copy it into the purple GEMINI API KEY node in ComfyUI
πΉ Step 2 β Choose a Reference Image
β’ Upload a visual reference OR
β’ Set Use Reference Image = False to start from a text description
β’ (Optional) Enter a project name for easy folder organization
πΉ Step 3 β Enter a Story Prompt
β’ Be as detailed or brief as you like
β’ If you skipped the reference image, describe your character clearly
πΉ Step 4 β Select Mode + Audio
β’ Choose: Portrait or Landscape
β’ Toggle MMAudio ON if you want auto-generated sound
πΉ Step 5 β Click QUEUE
β’ Generation begins β each step takes ~32 seconds
β’ Increasing the number of steps improves quality (but takes longer)
πΉ Step 6 β Select Your Scenes
β’ Youβll see image batches (4 at a time)
β’ Pick your favorites and drag them into order (1β24)
β’ Hit Cancel + Retry if the results arenβt what you want
πΉ Step 7 β Generate Video & Stitching
β’ Images are rendered into scenes
β’ Selected scenes are stitched into a full video (with audio, if enabled)
πΉ Step 8 β Retrieve Your Final Video
β’ Check the output folder
β’ Look for the .mp4 file (with "audio" in the name if you chose MMAudio)
π‘ Pro Tips:
β’ For best results: allow ~4.5 mins per scene on Ultra Pro
β’ Want higher detail? Increase steps from 8 β 16
β’ Imperfect outputs? Cancel & re-run! AI isn't flawless β but it is fast
π₯ Creators & Credits
AJO6268 aka KurtCPhotoEd
Clark Glenn Davis aka Verevolf
SoundTech: manu_le_surikhate_gamer
π§° INSTALLATION GUIDE β GETTING STARTED WITH THE DIRECTOR
β Requirements
Latest ComfyUI (Portable or Custom Build)
AjoNodes
MMAudio
Wan2.1 Native model
A Google account to access Gemini API
π§ Step-by-Step Installation
1. Install AjoNodes
AjoNodes contains all the custom logic that powers The Directorβs workflow.
π¦ GitHub: https://github.com/AJO-reading/ComfyUI-AjoNodes
π οΈ To install:
bash
CopyEdit
cd ComfyUI/custom_nodes git clone https://github.com/AJO-reading/ComfyUI-AjoNodes
Restart ComfyUI after installation.
2. Download & Install Wan2.1 Native Model
This is the core model used for visual generation.
π Wan2.1 ComfyUI Workflow - Complete Guide | ComfyUI Wiki
3. Install MMAudio (Sound Effects & Music)
Adds voice/music/sfx to your generated scenes.
π¦ GitHub: kijai/ComfyUI-MMAudio
Make sure to download the models and put them in ComfyUI/models/mmaudio
4. (Optional) Install Additional Models or LoRAs
Depending on the theme, you may want character-specific LoRAs or style models. Place those in:
models/loras/models/embeddings/(if using textual inversion)
5. Load the Director Workflow (.json)
β’ Open ComfyUI
β’ Load the provided TheDirectorV2.json workflow
β’ Paste your Gemini API Key in the designated node
β’ Youβre ready to go!
π Thatβs it β youβre ready to start generating movies like itβs Hollywood, minus the budget.