LTX-2 First Last Frame: Controllable Video & Audio Generation

์„ธ๋ถ€ ์ •๋ณด

ํŒŒ์ผ ๋‹ค์šด๋กœ๋“œ (1)

๋ชจ๋ธ ์„ค๋ช…

๐Ÿš€ Create flawless, audio-synced cinematic video transitions from just a start and end frame.

โ–ถ๏ธ Run Directly in Cloud:
https://www.runcomfy.com/comfyui-workflows/ltx-2-first-last-frame-in-comfyui-audio-visual-motion-control?utm_source=civitai


๐Ÿ’ก Overview

LTX-2 First Last Frame is a powerful ComfyUI workflow tailored for creators who demand precise cinematic control. Define your starting frame and your ending frame, and the pipeline will seamlessly generate the motion between themโ€”complete with synchronized audio and visuals in a single pass.

By conditioning on both boundaries (with an optional guiding middle frame), the workflow perfectly preserves your subject's identity, framing, and lighting. Itโ€™s the ultimate tool for executing narrative beats, flawless scene transitions, and complex camera movements where temporal continuity and audio sync are an absolute must.

โœจ Key Features

  • Absolute Motion Control: Lock in your first and last frames; the workflow handles the smooth transition in between without identity loss.

  • 1-Pass Audio & Video: Utilizes the LTXV Audio VAE to generate perfectly synchronized sound effects, dialogue, or ambience alongside your visual action.

  • Dynamic Camera Trajectories: Fully compatible with camera LoRAs, allowing you to easily execute Dolly In/Out, Jib Up/Down, and Static shots.

  • Integrated 2X Upscale: Features a built-in spatial upscaling pass to cleanly resolve complex lighting, refine background elements, and deliver crisp, high-fidelity micro-details.

๐Ÿš€ Getting Started

  1. Model Setup: The core engine is LTX-2 19B (dev). (Note: For machines under 2x Large specs, please ensure the fp8 safetensors model is selected to avoid out-of-memory errors.)

  2. Prompting: Describe the scene action in your positive prompt. List any unwanted characteristics in the negative prompt.

  3. Configure Control: Upload your start and end images. Fine-tune the first_strength and last_strength nodes to dictate how strictly the workflow adheres to your frame references.

  4. Generate: Execute the prompt. The workflow will base sample an AV latent, run a targeted upscale, and automatically mux the decoded frames into a polished, ready-to-use MP4 video.


Click the "Run Directly" link above to bypass local setup and test this workflow immediately in your browser.

์ด ๋ชจ๋ธ๋กœ ๋งŒ๋“  ์ด๋ฏธ์ง€