LTX-2 Caption tool - LD

Details

Download Files (1)

Model description

  • Recently updated 26/03/2026 - full female body pre-set and Instagram video type pre-set

    these focus on everything from clothing, to movement, ethnicity, breast size, actions. ect. (Instagram closer to clothed then naked)


    The idea is simple - Caption a video so well then you can Give that same caption to LTX2.3 to recreate the video - Surely that makes the best lora?

    If you find this tool really useful only IF IT WORKS PERFECTLY FOR YOU Consider - Buying-me-a-coffee <3
    it goes a long way into fuelling my efforts

    Install - Empty zip into a folder, - install bat, start bat, Model will download first run (load model)
    - small update added images to it not just video






By Default all caption tools only scan 1 frame of a video first/middle or last frame

This can scan up to 10 frames (a lot of v-ram and slower) equally spaced apart AFTER the video is segmented into the desired length, not before so its accurate to that exact clip.

Caption from a video, (2 separate frames of the 5 second video)



Images made by this model