Cartoonishly Eaten - Wan2.1 14b T2V (use VACE for I2V/FLF)
Details
Download Files
Model description
Cartoonishly Eaten
This LoRA introduces the concept of subjects and things eaten in cartoon-like way by being suddenly tossed into a giant mouth and then chewed and consumed non-graphically.
To choose the eater and/or the thing being eaten, use VACE. The examples illustrate each mode, in order: 2 x first-frame2video, first-last2video, last-frame2video, pure text2video (not recommended, as it's slightly retrained in this mode).
The generation resolution is advised to be 512x512 or close in the spatial dimensions, with recommended duration 45 base frames (49 total, 3 seconds).
The t2v training was made using diffusion-pipe for 100 epochs and flow shift of 4.5.
For image2video/flf2video recommended to use with the kijai VACE workflows, standard 1.0 lora weight, 4.0-6.0 cfg, 8.0-16.0 shift + cfg_zero_star. (see videos meta in comfy)
Best works on cartoon-stylized and anime characters, can be weird on realistic. For realistic, supplying an additional existing cartoon-style reference is advised.
Known issues: the object sometimes is bit/chewed like a gum, but not swallowed. (can be slightly countered with adding pushing it inside with the hand.)
The trigger word is 'eat style'. The best prompts are:
"""
eat style. The video begins with [object]. Then a gigantic cartoon hand seizes the [object] from below and tosses it into a gigantic mouth, which appeared on the right side. The camera zooms out, showing the new [eater] chewing and fully swallowing the old tiny [object].
"""
This LoRA is not for a beginner, so good luck!
P.S. In case you can't find the metadata, the example workflow for the wolf is here https://gist.github.com/kabachuha/a4b5ed1b46b6d4fb5f9e91d8aae1e482
