MrXin Wan 2.2 I2V, MMA, Upscaler, 50FPS Workflow
Details
Download Files (1)
About this version
Model description
This workflow is designed for ComfyUI and leverages the Wan 2.2 Enhanced NSFW I2V model (in GGUF and Safetensors formats) to generate high-quality, dynamic image-to-video (I2V) animations, with a strong focus on NSFW content. It supports advanced features like model switching (high/low quality), audio generation via MMAudioSampler, video upscaling, color matching, and final video compilation at up to 50 FPS. The workflow includes built-in LoRA triggers for specific NSFW scenarios (e.g., cowgirl, deepthroat, cunnilingus, full nelson), making it ideal for creating sensual, explicit animations with realistic motion, lighting, and details.
Key Features:
Image-to-Video Generation: Converts a single input image into a video sequence using the WanImageToVideo node. Supports frame lengths up to 81, batch sizes, and resolutions like 480x720 (configurable via nodes like WIDTH, HEIGHT, LENGTH).
Model Variants: Switch between high-fidelity (Q8H/FP8H) and lightweight (Q8L/FP8L) versions of the Wan 2.2 model for performance optimization. Includes SD3 sampling shifts for better motion coherence.
Prompting System: Dual CLIP text encoders for positive/negative prompts. Built-in notes provide example triggers and prompts for NSFW acts (see original for examples).
Audio Integration: Generates ambient audio (e.g., moans, music) using MMAudioSampler with customizable duration, steps, CFG, and prompts. Negative audio prompts avoid low-quality noise or speech.
Post-Processing: VAE decoding for clean frames; image resizing and upscaling; color matching and restoration; video combining with VHS_VideoCombine (supports H264/H265 MP4, ping-pong looping, CRF quality control, and metadata saving). Preview options: Animation preview at 16 FPS and audio playback.
Optimization: VRAM cleanup nodes, CPU/GPU device switching, and batch processing for efficiency. Supports random seeds for variation.
Output: Saves videos/images in folders like "LongVid/%date:yyyy-MM-dd%/%date:hhmmss%" with prefixes (e.g., V for video, I for image, A for audio). Final videos can be upscaled to 50 FPS.
Requirements:
ComfyUI Version: Latest stable (tested on 2024–2026 builds).
Models (place in the appropriate ComfyUI folders: models/unet, models/vae, models/clip_vision, models/text_encoders, etc.):
Main Diffusion Models (Wan 2.2 Enhanced NSFW SVI Camera variants) — from nolightning's Lightning Edition pack:
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8H.gguf →
https://civitai.com/api/download/models/2540892?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8L.gguf →
https://civitai.com/api/download/models/2540896?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8H.safetensors → https://civitai.com/api/download/models/2477539?type=Model&format=SafeTensor&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8L.safetensors → https://civitai.com/api/download/models/2477548?type=Model&format=SafeTensor&size=full&fp=fp8
VAE: Wan2.1_VAE.pth →
CLIP Vision: clip_vision_h.safetensors →
CLIP Text Encoder: umt5_xxl_fp8_e4m3fn_scaled.safetensors →
Audio: MMAudio model (via comfyui-mmaudio extension) — install the extension; models are usually auto-downloaded or available in the repo.
Upscale model: 4x_NMKD-Siax_200k →
https://civitai.com/api/download/models/2052724?type=Model&format=PickleTensor
Custom Nodes/Extensions (install via ComfyUI Manager):
comfyui-gguf (for GGUF model loading).
ComfyUI_Comfyroll_CustomNodes (math/utils).
comfyui-easy-use (cleanGpuUsed, mathFloat).
comfyui-kjnodes (INTConstant, ImageResizeKJv2, LoadVideosFromFolder, PreviewAnimation).
comfyui-videohelpersuite (VHS_VideoCombine).
comfyui-mmaudio (MMAudioSampler, audio preview).
comfyui-image-saver (Sampler/Scheduler selectors).
controlaltai-nodes (TwoWay/ThreeWaySwitch).
ComfyLiterals (Float node).
comfyui_memory_cleanup (VRAMCleanup).
Hardware: GPU with at least 12GB VRAM recommended for high-quality runs (e.g., 81-frame videos). CPU fallback available for some nodes.
How to Use:
Load the Workflow: Import the JSON into ComfyUI.
Input Image: Connect an image to the "IMAGE" node (e.g., via Load Image). Resize settings are in the "LOAD IMAGE & RESIZE" group.
Prompts: Edit the POSITIVE/NEGATIVE nodes with your description. Use the built-in trigger words for best NSFW results.
Settings: Adjust in "VIDEO SETTINGS" group:
Resolution: WIDTH/HEIGHT (default 480x720).
Frames: LENGTH (default 81), STEPS (default 8), CFG (default 1).
Seed: Randomize for variations.
Sampler/Scheduler: Euler Ancestral + Simple (defaults).
Batch Size: 1 (increase for multiples).
Run: Queue the prompt. Monitor VRAM with cleanup nodes.
Outputs: Videos save to ComfyUI/output/LongVid (customizable). Preview animation and audio in the workflow.
Advanced: Toggle high/low model switches for quality vs. speed. Add audio prompts in MMAudioSampler. Upscale in the "UPSCALE" group for smoother 50 FPS output.
Tips for Best Results:
NSFW Focus: Start with the example prompts in the notes for fluid motion (e.g., thrusting, jiggling). Avoid overlong prompts to prevent artifacts.
Audio Sync: Match audio duration to video length (default 10s). Use positive prompts like "moans, sensual sounds" and negatives to avoid distortion.
Performance: For low VRAM, use GGUF low models and disable audio/upscaling. Force offload in MMAudioSampler if needed.
Customization: Experiment with LoRAs (loaded in "LOAD LORA'S" group) for specific styles. Negative prompts handle artifacts like blur, distortion, or bad anatomy.
This workflow is optimized for explicit, high-detail NSFW I2V—perfect for creators exploring sensual animations.
