MrXin Wan 2.2 I2V, MMA, Upscaler, 50FPS Workflow
세부 정보
파일 다운로드 (1)
이 버전에 대해
모델 설명
This workflow is designed for ComfyUI and leverages the Wan 2.2 Enhanced NSFW I2V model (in GGUF and Safetensors formats) to generate high-quality, dynamic image-to-video (I2V) animations, with a strong focus on NSFW content. It supports advanced features like model switching (high/low quality), audio generation via MMAudioSampler, video upscaling, color matching, and final video compilation at up to 50 FPS. The workflow includes built-in LoRA triggers for specific NSFW scenarios (e.g., cowgirl, deepthroat, cunnilingus, full nelson), making it ideal for creating sensual, explicit animations with realistic motion, lighting, and details.
Key Features:
Image-to-Video Generation: Converts a single input image into a video sequence using the WanImageToVideo node. Supports frame lengths up to 81, batch sizes, and resolutions like 480x720 (configurable via nodes like WIDTH, HEIGHT, LENGTH).
Model Variants: Switch between high-fidelity (Q8H/FP8H) and lightweight (Q8L/FP8L) versions of the Wan 2.2 model for performance optimization. Includes SD3 sampling shifts for better motion coherence.
Prompting System: Dual CLIP text encoders for positive/negative prompts. Built-in notes provide example triggers and prompts for NSFW acts (see original for examples).
Audio Integration: Generates ambient audio (e.g., moans, music) using MMAudioSampler with customizable duration, steps, CFG, and prompts. Negative audio prompts avoid low-quality noise or speech.
Post-Processing: VAE decoding for clean frames; image resizing and upscaling; color matching and restoration; video combining with VHS_VideoCombine (supports H264/H265 MP4, ping-pong looping, CRF quality control, and metadata saving). Preview options: Animation preview at 16 FPS and audio playback.
Optimization: VRAM cleanup nodes, CPU/GPU device switching, and batch processing for efficiency. Supports random seeds for variation.
Output: Saves videos/images in folders like "LongVid/%date:yyyy-MM-dd%/%date:hhmmss%" with prefixes (e.g., V for video, I for image, A for audio). Final videos can be upscaled to 50 FPS.
Requirements:
ComfyUI Version: Latest stable (tested on 2024–2026 builds).
Models (place in the appropriate ComfyUI folders: models/unet, models/vae, models/clip_vision, models/text_encoders, etc.):
Main Diffusion Models (Wan 2.2 Enhanced NSFW SVI Camera variants) — from nolightning's Lightning Edition pack:
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8H.gguf →
https://civitai.com/api/download/models/2540892?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8L.gguf →
https://civitai.com/api/download/models/2540896?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8H.safetensors → https://civitai.com/api/download/models/2477539?type=Model&format=SafeTensor&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8L.safetensors → https://civitai.com/api/download/models/2477548?type=Model&format=SafeTensor&size=full&fp=fp8
VAE: Wan2.1_VAE.pth →
CLIP Vision: clip_vision_h.safetensors →
CLIP Text Encoder: umt5_xxl_fp8_e4m3fn_scaled.safetensors →
Audio: MMAudio model (via comfyui-mmaudio extension) — install the extension; models are usually auto-downloaded or available in the repo.
Upscale model: 4x_NMKD-Siax_200k →
https://civitai.com/api/download/models/2052724?type=Model&format=PickleTensor
Custom Nodes/Extensions (install via ComfyUI Manager):
comfyui-gguf (for GGUF model loading).
ComfyUI_Comfyroll_CustomNodes (math/utils).
comfyui-easy-use (cleanGpuUsed, mathFloat).
comfyui-kjnodes (INTConstant, ImageResizeKJv2, LoadVideosFromFolder, PreviewAnimation).
comfyui-videohelpersuite (VHS_VideoCombine).
comfyui-mmaudio (MMAudioSampler, audio preview).
comfyui-image-saver (Sampler/Scheduler selectors).
controlaltai-nodes (TwoWay/ThreeWaySwitch).
ComfyLiterals (Float node).
comfyui_memory_cleanup (VRAMCleanup).
Hardware: GPU with at least 12GB VRAM recommended for high-quality runs (e.g., 81-frame videos). CPU fallback available for some nodes.
How to Use:
Load the Workflow: Import the JSON into ComfyUI.
Input Image: Connect an image to the "IMAGE" node (e.g., via Load Image). Resize settings are in the "LOAD IMAGE & RESIZE" group.
Prompts: Edit the POSITIVE/NEGATIVE nodes with your description. Use the built-in trigger words for best NSFW results.
Settings: Adjust in "VIDEO SETTINGS" group:
Resolution: WIDTH/HEIGHT (default 480x720).
Frames: LENGTH (default 81), STEPS (default 8), CFG (default 1).
Seed: Randomize for variations.
Sampler/Scheduler: Euler Ancestral + Simple (defaults).
Batch Size: 1 (increase for multiples).
Run: Queue the prompt. Monitor VRAM with cleanup nodes.
Outputs: Videos save to ComfyUI/output/LongVid (customizable). Preview animation and audio in the workflow.
Advanced: Toggle high/low model switches for quality vs. speed. Add audio prompts in MMAudioSampler. Upscale in the "UPSCALE" group for smoother 50 FPS output.
Tips for Best Results:
NSFW Focus: Start with the example prompts in the notes for fluid motion (e.g., thrusting, jiggling). Avoid overlong prompts to prevent artifacts.
Audio Sync: Match audio duration to video length (default 10s). Use positive prompts like "moans, sensual sounds" and negatives to avoid distortion.
Performance: For low VRAM, use GGUF low models and disable audio/upscaling. Force offload in MMAudioSampler if needed.
Customization: Experiment with LoRAs (loaded in "LOAD LORA'S" group) for specific styles. Negative prompts handle artifacts like blur, distortion, or bad anatomy.
This workflow is optimized for explicit, high-detail NSFW I2V—perfect for creators exploring sensual animations.
