Upload In Progress....

COMING SOON :

FP8 DISTILLED VERSION.
LORA DISTILLED VERSION
SPATIAL UPSCALER
TEMPORAL UPSCALER
CAMERA CONTROL LORAS.
CONTROLNET AIO LTX2
Workflows I2V / V2V / T2V / VDETAILER.

⚡ LTX-2 FP8 — Distilled (Fast & Lightweight)

What is LTX-2 FP8 Distilled?

The FP8 Distilled version is a compressed and accelerated variant of LTX-2, trained to replicate the behavior of the full model while being faster and lighter.

Distillation reduces model complexity, making it more efficient — at the cost of some fine-grained detail.

✅ Key Characteristics

Faster generation speed
Lower VRAM requirements
Quicker prompt response
Slightly reduced fine detail compared to full FP8
Excellent quality-to-performance ratio

🎯 Best Use Cases

Rapid iteration & testing
Prompt exploration
Draft videos and previews
Creators with limited hardware

Recommended if:
You want speed and accessibility, and are willing to trade a small amount of detail for faster results.

🔹 LTX-2 FP8 — Standard (Full Quality)

What is LTX-2 FP8 (Standard)?

The FP8 Standard version is a full-quality LTX-2 model quantized to FP8 precision.
It preserves the complete architecture and capabilities of the original model while reducing memory usage.

This is NOT a simplified model.
Only the numerical precision is reduced — the model’s intelligence, structure, and behavior remain intact.

✅ Key Characteristics

High visual fidelity and detail
Strong temporal consistency
Full audio-video synchronization
Lower VRAM usage than FP16
Stable and reliable for long generations

🎯 Best Use Cases

Cinematic video generation
Final renders and high-quality outputs
Creators who want maximum quality with lower hardware requirements

Recommended if:
You want the best possible quality in FP8, with no compromise on features or flexibility.

🧠 Which One Should You Choose?

🎬 Go with FP8 Standard if quality and consistency matter most
⚡ Go with FP8 Distilled if speed and efficiency are your priority

Both versions are fully compatible with ComfyUI workflows and part of the same LTX-2 creative ecosystem.

📌 What is LTX-2?

LTX-2 is a powerful multimodal AI model that transforms text prompts, images, or other media into fully synchronized audiovisual videos — with motion, dialogue, music, and ambient sound generated in one unified pass. It’s built on a hybrid Diffusion-Transformer (DiT) architecture designed specifically for efficient spatiotemporal generation and audio-video alignment. LTX-2+1

This approach lets creators go from idea to cinematic result without stitching separate audio tracks manually — a major step beyond typical text-to-video systems. LTX-2

✨ Key Features & Capabilities

🎥 Cinematic Quality Output

Native 4K resolution support with playback up to 50 FPS, delivering smooth, high-detail video clips ideal for cinematic, commercial, or creative use. LTX-2

🎵 Unified Audio & Visual Generation

Generates synchronized audio — including dialogue, ambience and music — alongside the video in a single generation pass, removing the need for external audio sync tools. LTX-2

🔄 Flexible Input & Output Modes

Works with text prompts, image references, multi-keyframe conditioning, and more to animate concepts or stills into motion. LTX-2

⚙️ Performance Modes

Multiple performance configurations (Fast, Pro, Ultra) allow creators to balance speed and quality according to project needs — from quick drafts to production-ready renders. LTX-2

🧠 Efficient & Accessible

Highly optimized for consumer-grade GPUs — efficient enough to run on ~16 GB VRAM hardware with FP8/FP4 quantization options — making AI video production more accessible. Reddit

🛠️ Open & Extensible

Fully open weights, codebase, and workflows, enabling fine-tuning, custom LoRAs, and integration into tools like ComfyUI. Hugging Face

📈 Improvements Over Earlier Versions

Compared to the original LTX family and other open video models, LTX-2 raises the bar in several key areas:

✅ Audio Integration Built-In
Instead of generating silent videos and requiring post-processing, LTX-2 outputs audio and visual streams together with temporal coherence. LTX-2

✅ Higher Resolution & Frame Rates
Supports native 4K at up to 50 frames per second, reaching cinema-grade quality unlike many earlier community models that cap at lower resolutions or fps. LTX-2

✅ Longer Clips
Offers extended duration generation (up to ~20 s clips) with continuous quality and audio coherence — exceeding many alternatives. LTX-2+1

✅ Expanded Workflows
Native support in ComfyUI plus custom workflows empowers users with text-to-video, image-to-video, multi-keyframe conditioning, and creative control nodes. comfyui.org +1

🧠 Typical Use Cases

🔹 Cinematic storyboarding & concept visuals
🔹 Social media & marketing video content
🔹 Animated storytelling & motion design
🔹 Game cutscenes & immersive narratives
🔹 Product visualizations & dynamic ads

Whether for rapid prototyping or production output, LTX-2 empowers creators with professional-grade generative video. LTX-2

🧩 Included Files & Variants

Depending on the checkpoint uploaded, this collection may include:

Full Model Checkpoints (bf16 / fp8 / fp4) — maximum quality with quantization options
Distilled Variants — faster iteration with lighter compute cost
Spatial & Temporal Upscalers — improve resolution or frame rate via multiscale pipelines
LoRA & Fine-Tuning Packs — custom stylistic or control extension modules Hugging Face

🔧 ComfyUI Integration & Workflows

Included workflow templates help you use LTX-2 in ComfyUI with nodes for:

📌 Text-to-Video — generate animated clips from prompts
📌 Image-to-Video — animate still images with camera motion and style
📌 Video Conditioning — extend clips forward/backward or refine motions
📌 Keyframe Controls — precise guidance over scene transitions

These workflows are designed for ease-of-use and creative flexibility while demonstrating best practices for prompt structure and smooth temporal motion. LTX Documentation

🧠 Foundation Model Philosophy

LTX-2 goes beyond a single task — it’s a foundation model for audiovisual creative AI. Open access to its weights, code, and tools encourages developers, artists, researchers, and hobbyists alike to customize, extend and innovate on a common platform. Hugging Face

📌 Summary

LTX-2 is not just another video model — it is a production-ready, synchronized audio-video foundation model that pushes the boundaries of what open discourse video generation can achieve. With cinematic output quality, flexible workflows, and a fully open ecosystem, LTX-2 stands as one of the most capable generative video tools available today. LTX-2

Model Type	Checkpoint
Base Model	LTXV
Published	1/9/2026

LTX-2 -19B :Next-Gen AI Video & Audio Generation Model

Details

Download Files

Model description

Upload In Progress....

⚡ LTX-2 FP8 — Distilled (Fast & Lightweight)

What is LTX-2 FP8 Distilled?

✅ Key Characteristics

🎯 Best Use Cases

🔹 LTX-2 FP8 — Standard (Full Quality)

What is LTX-2 FP8 (Standard)?

✅ Key Characteristics

🎯 Best Use Cases

🧠 Which One Should You Choose?

📌 What is LTX-2?

✨ Key Features & Capabilities

🎥 Cinematic Quality Output

🎵 Unified Audio & Visual Generation

🔄 Flexible Input & Output Modes

⚙️ Performance Modes

🧠 Efficient & Accessible

🛠️ Open & Extensible

📈 Improvements Over Earlier Versions

🧠 Typical Use Cases

🧩 Included Files & Variants

🔧 ComfyUI Integration & Workflows

🧠 Foundation Model Philosophy

📌 Summary

Images made by this model