LTX 2.3 Sulphur 2 text to video workflow in ComfyUI | Cinematic Animation

詳細

ファイルをダウンロード (1)

モデル説明

Turn text into cinematic character videos with synced motion fast.

Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning - you still choose inputs, prompts, and settings.

Open preloaded workflow on RunComfy

Open preloaded workflow on RunComfy (browser)

Why RunComfy first
- Fewer missing-node surprises - run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout - useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON - the zip follows the same runnable workflow you can open on RunComfy.

When downloading for local ComfyUI makes sense - you want full control over models on disk, batch scripting, or offline runs.

How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.

Expectations - First run may pull large weights; cloud runs may require a free RunComfy account.


Overview

With the LTX 2.3 Sulphur 2 setup, you can transform text prompts into cinematic character animations with synchronized audio and motion. It integrates LTXV conditioning and Sulphur 2 modeling for smoother human movement and detailed visual rendering. Users can achieve high-quality results for short film concepts, animation tests, or storytelling prototypes. This workflow combines text, audio, and latent video decoding for seamless end-to-end creation. It suits creators needing rapid, controllable, and expressive video generation.

Important nodes:

Key nodes in Comfyui LTX 2.3 Sulphur 2 text to video workflow

LTXVConditioning (#304)
Merges positive and negative text conditioning and attaches the working frame rate so temporal guidance matches your render. Strong, specific scene language improves shot structure; concise negatives reduce artifacts. See the LTX‑2.3 model card for conditioning notes. Hugging Face: Lightricks/LTX-2.3

LTXVCropGuides (#284)
Softly steers composition to keep the main subject framed as intended. Use it to protect face size, horizon placement, or a centered subject before upscaling and refinement. It is especially helpful for dialogue‑style shots and medium closeups.

CFGGuider (#313, #282)
Controls how aggressively the prompt influences the diffusion trajectory in both passes. Use the first guider to lock in motion and staging, then the second to add crispness without drifting away from the established shot.

ManualSigmas (#306, #281)
Defines the noise schedule. Front‑loading more noise encourages larger motion exploration; a gentler schedule emphasizes temporal consistency. Keep the low‑res and high‑res schedules complementary rather than identical.

LTXVLatentUpsampler (#287)
Performs x2 latent upscaling using the official LTX upscaler so you gain detail before the refinement sampler. Swapping to another LTX‑2.3 upscaler variant can slightly change sharpness and grain. Hugging Face: Lightricks/LTX-2.3

VAEDecodeTiled (#314)
Decodes long or large clips in manageable tiles to avoid VRAM spikes. If you change spatial size or clip length, adjust tiling to balance memory headroom and decode speed.

LoraLoaderModelOnly (#285)
Applies the Sulphur 2 LoRA to the base model path so character fidelity and style cues transfer into both sampling stages. Use this to switch looks quickly while keeping the same LTX‑2.3 backbone. Hugging Face: SulphurAI/Sulphur-2-base

Notes

LTX 2.3 Sulphur 2 text to video workflow in ComfyUI | Cinematic Animation - see RunComfy page for the latest node requirements.

このモデルで生成された画像