LTX-2 Caption tool - LD
Details
Download Files (1)
Model description
Recently updated 26/03/2026 - full female body pre-set and Instagram video type pre-set
these focus on everything from clothing, to movement, ethnicity, breast size, actions. ect. (Instagram closer to clothed then naked)
The idea is simple - Caption a video so well then you can Give that same caption to LTX2.3 to recreate the video - Surely that makes the best lora?
If you find this tool really useful only IF IT WORKS PERFECTLY FOR YOU Consider - Buying-me-a-coffee <3
it goes a long way into fuelling my efforts
Install - Empty zip into a folder, - install bat, start bat, Model will download first run (load model)
- small update added images to it not just video





By Default all caption tools only scan 1 frame of a video first/middle or last frame
This can scan up to 10 frames (a lot of v-ram and slower) equally spaced apart AFTER the video is segmented into the desired length, not before so its accurate to that exact clip.
Caption from a video, (2 separate frames of the 5 second video)


