GLOW-UP aka lazy-WAN-KR (Florence Caption -> WAN2.2 T2I -> WAN2.2 I2V)

Details

Model description

Front end to WAN-KR workflow to caption an image via Florence and use WAN2.2 as I2V to create it anew and then feed it to WAN 2.2 for I2V.

V0.6 only: (This is basically multi-WAN-KR with a front end. For usage of multi-WAN-KR see here: multi-WAN-KR (WAN 2.2 I2V clip combine workflow) - v1.0rc3 | Wan Video Workflows | Civitai ) As of 0.8 it is a single clip Workflow.

You additionally need the low noise checkpoint for WAN 2.2 T2V although it should also work with WAN2.1 T2V

How to use: drop your old dusty SD1.5/SDXL image, select a target resolution for WAN T2V - as high as your machine permits and generate. In case you want to check the result first before I2V block the video gen with the black group bypasser.

If T2I looks fine, unblock GFXCARD_GOES_BRRR and let it do it's thing.
You can add your own prompt directives to the Florence caption, e.g. describe a camera movement, or an action happening.

TO DO: Add possibility to modify/enhance the Florence caption via LLM

Images made by this model

No Images Found.