LTX-2 Image Audio to Video

PixelMuseAI

198

5.9K

LTX2.3 v3.0 v2.0 v1.0

Details

Download Files (1)

About this version

Model description

This workflow takes an Image and an audio track as input to generate a video.
Important Notice

Update ComfyUI and KJ Nodes. A lot of the code has been updated in the last few days.

Include --reserve-vram 1 in your launch option to avoid OOM.

If you have no lipsync, try ensuring that your audio track is in stereo format. fix suggested by @thomasdimitri563

Models to download (LTX2.3)

Place in models/diffusion_models

https://huggingface.co/Kijai/LTX2.3_comfy/blob/main/diffusion_models/ltx-2.3-22b-dev_transformer_only_fp8_scaled.safetensors

Place in models/loras

https://huggingface.co/Lightricks/LTX-2.3/blob/main/ltx-2.3-22b-distilled-lora-384.safetensors

Place in models/text_encoders

https://huggingface.co/Comfy-Org/ltx-2/resolve/main/split_files/text_encoders/gemma_3_12B_it_fp4_mixed.safetensors

https://huggingface.co/Kijai/LTX2.3_comfy/blob/main/text_encoders/ltx-2.3_text_projection_bf16.safetensors

Place in models/vae

https://huggingface.co/Kijai/LTX2.3_comfy/blob/main/vae/LTX23_audio_vae_bf16.safetensors

https://huggingface.co/Kijai/LTX2.3_comfy/blob/main/vae/LTX23_video_vae_bf16.safetensors

Models to download (V3)

Place in models/diffusion_models

https://huggingface.co/Lightricks/LTX-2/resolve/main/ltx-2-19b-distilled-fp8.safetensors

Place in models/text_encoders

https://huggingface.co/Comfy-Org/ltx-2/resolve/main/split_files/text_encoders/gemma_3_12B_it_fp4_mixed.safetensors

Place in models/loras

https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/resolve/main/ltx-2-19b-ic-lora-detailer.safetensors

https://huggingface.co/Lightricks/LTX-2-19b-LoRA-Camera-Control-Static/resolve/main/ltx-2-19b-lora-camera-control-static.safetensors

Images made by this model

Sort by