Yet Another Workflow: easy t2v + i2v (Wan 2.2)
Details
Download Files
About this version
Model description
Yet Another Workflow : easy t2v + i2v
I've aimed at a user-friendly UI for ComfyUI. There's a balance between complexity and ease of use, and this workflow aims to give you useful controls with clear guidance on what you need to care about. I hope these will be helpful to anyone strugging with quality and the general UI-isms of ComfyUI. I've taken the time to color code and add lots of notes. Please read the notes, I've tried to make them useful!
This is the workflow I use, it's not aimed at a skill level. It's designed to be easy to use and adjust with some UI concessions and labeling to ensure you can pilot it with less experience in a way that is more sophisticated than the official example workflows, which can be easy to break.
The primary goal with this workflow is to give you a strong foundational place to generate either text to video (T2V) or image to video (I2v) outputs without having to fuss too much. Lightx2\ning is on by default. (It's an accelerator that trades variety for generation speed.)
The green controls are the stuff you generall want to mess with.
The secondary goal here is to provide a consistent interface to interact with different samplers.
Versions
The main workflows supports the basic ksampler node, but also includes a toggle to enable the Clownshark sampler once you have some experience and want to mess around.
With version v0.3, in addition to additional clean-up and other minor improvements, I've introduced the TripleKSampler, which adds a new node that enables a more advanced 3 sampler technique which attempts to create an initial noise pattern without any accelertation LoRA's to create videos that maintain more of vanilla Wan's dynamics (at the expense of more time - about 15-30% more time).
I'll be updating the alternate versions on an ongoing basis.
If you are extremely new to Comfy and Wan? Consider using the MoE version. It removes a few nodes while providing mostly the same interface with slightly less visual complexity to help you get acclimated. Once you get comfy with this, step up up to the main version for more options.
Want better edge case prompt adherance? I've created a version of the workflow that supports the WanVideo nodes. I don't recommend using this one until you're more comfy with the standard version. It has increased visual complexity. These nodes work completely different to other systems, and I hope to make it more accessible by providing you with the same interface to engage with it. WanVideo tends to produce completely different results, so it can be another intersting thing to explore.
Like it?
Tag it as a Resource when you use it! Give it a thumb! I don't really need buzz, but if you want to tip me by messaging me a RunPod credit code, I'd be appreciative!
Need help?
I like helping people get going with this stuff, so if you want help message me. I'll walk you through the details, answer your questions, and give you some extra tips and
tricks, and scripts. I've done this for a few folks, I'll save you money and headaches. (I will ask that you send me a RunPod credit code if you take me up on this.)
I've also written an article here, which includes a shell script to speed up your setup time. The article has been updated with a newer, faster setup script alongside the release of v0.3.
General Advice
Make lots of videos! Post your videos! Don't fuss with the tech! Be smart about how you spend your time with this stuff. It's easy to burn out if you spend more time trying to get things to work than making videos you like. That's really why I'm posting these.
Use RunPod. Use the L40S. Use Hearmeman's Wan 2.2 template. If you've not used RunPod before, sign up with my link; we'll both get some free credit.
You'll need to install some custom nodes. To do that, click the "Manager" button at the top of the Comfy interface, and then click the "Install Missing Custom Nodes". Click "Install" on each one - I recommend in order; you'll need to wait till each has installed. Do not bother restarting ComfyUI until they are all installed.
If the wires bother you there's a button in the bottom right on the floating UI that will hide them.
What is Lightx2\ning? That's just my short hand for refering to Lightx2 and Lightning (which is just the Wan 2.2 version).
I've made it easy to turn off Lightx2\ning as well, if you want to try without, but note that it's much slower!
This workflow is setup for .safetensors models, but you can use GGUF if you want to make the changes.
If the having the Clownshark sampler in the UI is distracting, you can delete the group with no negative consequence. (You could also delete the purple mute node for the sampler selection as well.)
Costs?
In case you are curious, the example videos take around 6.5 minutes (720x1280). (I don't normally do that resolution when I'm just making stuff and experimenting.) I can generally make nice looking videos in 2-3 minutes. I'm generally running at about $0.90 per hour; in generally I think between 9 and 20 high quality videos per hour is what I tend to see, so about $0.045 - $0.10 per video, (rounding up) with the L40S. (With a session startup cost for loading the pod, probably adding a cent to so to that.) 1 to 2 minutes is probably my gen sweet spot for time, so it's a bit over my ideal, but that's a cost consideration. A faster GPU would get me there at a high cost, but it's the right balance for me with the current tech.
