Wan2.2 I2V 12GB (20 Seconds, MMAUDIO, 60FPS, LOW VRAM)

세부 정보

파일 다운로드 (1)

모델 설명

WAN 2.2 – I2V Workflow (Optimized for 12GB GPUs)

A fast, clean, and VRAM-efficient Image-to-Video workflow built around WAN 2.2. Fast render times on mid-range GPUs. I tried to keep this simple and easy to use, while maintaining good results. Utilizing well known nodes, and minimizing node bloat. The workflow also has comments everywhere and clear flow.

Ver 1.0 - Base workflow, can do 5 second clips in one iteration. (very fast for 12gb)

Ver 1.1 - More stability, can run 100 times consecutively in 8hrs

Ver 1.2 - Renders 20 second videos. Cleanup of wires.

Ver 1.3 - MMAudio added.

Ver 1.4 - 2x Upscaling, color correction, & sharpening in between passes for quality consistency.

Ver 1.5 - Fixed MMAudio, Updated controls & ability to do 5, 10, 15, & 20 second videos easy. Split RIFE between phases. Fixed prompts. Cleaned up workflow.

  • WAN 2.2 Model
    Use either lightweight GGUF models or full .safetensors checkpoints.
    Lightning LoRAs are baked into the .safetensors model, so LoRA use is optional.

  • WAN 2.1 VAE
    Reduces VRAM load while maintaining strong color, detail structure, and temporal consistency.

  • SageAttention + FP16 Accumulation Patch
    Automatically applied for speed and throughput.

  • Dual KSampler passes (HIGH/ LOW)
    Uses 6 steps, and 101-frame length for smooth animations and solid adherence.

  • LoRA Loader + Model Shift Controls
    Supports stylistic LoRAs with synchronized shift values across both samplers.

  • RIFE Frame Interpolation (60 FPS)
    Creates ultra-fluid motion.

  • Upscale + Adaptive Sharpen Pass
    In between each phase ultrasharp upscaler is used for consistency.

Performance

  • GPU: 12GB VRAM

  • Render Speed: ~6 minutes (v1.0)

  • Output FPS: ~60–64FPS

  • Resolution: ~1072 × 1616 (post-upscale)

Requirements

이 모델로 만든 이미지