Z-Image-Turbo/Base-AIO-Workflow

่ฏฆๆƒ…

ไธ‹่ฝฝๆ–‡ไปถ (1)

ๆจกๅž‹ๆ่ฟฐ

๐Ÿš€ Z-Image AIO | Official Workflows

Turbo (8 Steps) & Base (28-50 Steps) โ€ข Photorealistic Generation โ€ข Bilingual Text โ€ข FP8 / FP16 / BF16

โš ๏ธ Requires ComfyUI v0.11.0+ โ†’ Download here


๐Ÿ“ฆ Eight Official Workflows Available

All workflows work with FP8 (~10GB), FP16 (~20GB), and BF16 (~20GB) versions!


๐Ÿ†• Z-Image-Base-AIO Workflow (ZIB-AIO-Base)

Full foundation model with maximum creative control!

The undistilled 6B parameter model for professional work. Features full CFG control (3.0-5.0), negative prompt support, and high output diversity. Ideal base for LoRA training and complex prompt engineering. Includes SeedVR2 upscaler and Seed Variance Enhancer as optional features.

Key Features:

  • โœ… Full CFG control (3.0-5.0)

  • โœ… Negative prompts supported

  • โœ… High output diversity

  • โœ… Ideal for LoRA/ControlNet development

  • โœ… Optional LoRA loading via LoraManager

  • โœ… Optional SeedVR2 upscaling

  • โœ… Optional Seed Variance Enhancer

Required Custom Nodes: 5 nodes (see below)


๐Ÿš€ Standard Workflow v1.0 & v2.0 (ZIT-AIO-v1.0 / ZIT-AIO-v2.0)

Simple text-to-image workflow with improved upscaler and dual sampler options. Features automatic metadata saving and denoise control for upscaling. Perfect for beginners and quick generations. Requires 2 custom nodes.

v2.0 improvements: Enhanced upscaler with denoise control, better sampler options (res_multistep or euler_ancestral), dual scheduler support (simple or beta).


๐ŸŽฎ ControlNet Workflow (ZIT-AIO-Control)

Guided generation with reference images using ControlNet Union (Canny, HED, Depth, Pose, MLSD). Uses megapixel scaling that maintains aspect ratio automatically. Perfect for sketch-to-photo, pose transfer, and precise composition control. Requires ComfyUI 3.77+ and ControlNet Union file.


๐ŸŽฒ Seed Variance Enhancer Workflow (ZIT-AIO-Variance)

Adds diversity to outputs by introducing controlled noise to text embeddings. Compensates for low seed variance - get more varied results with the same prompt. Includes manual seed control for reproducibility. Requires SeedVarianceEnhancer custom node.


๐ŸŽฌ SeedVR2 Video Upscaler Workflow (ZIT-AIO-SeedVR2)

Professional diffusion-based upscaling using DiT (Diffusion Transformer) models. Delivers superior quality with temporal consistency for videos and images. Supports multiple model variants (3B/7B with FP16/FP8/GGUF) and memory optimization options. Requires SeedVR2 custom node.


๐ŸŒŠ Depth Anything V3 ControlNet Workflow (ZIT-AIO-DepthV3)

State-of-the-art depth-guided generation with dual modes: create depth-controlled images OR preview depth as 3D point clouds. Superior multi-view depth consistency compared to traditional methods. Features toggle system for easy mode switching. Requires Depth Anything 3 custom nodes.


๐Ÿ–ผ๏ธ Z-Image-Turbo-Anime Workflow (ZIT-AIO-Anime)

This workflow includes several small but meaningful adjustments and integrates multiple custom nodes. It features the Seed Variance Enhancer, which helps generate different image variations from the same prompt by increasing effective seed diversity. Additionally, SeedVR2 is included as an alternative upscaling solution.

Z-Image-Turbo-Anime: https://civitai.com/models/2259646/z-image-turbo-anime


๐Ÿ“Š Quick Comparison

Turbo Workflows (8-9 Steps, CFG 1.0)

๐Ÿš€ Standard v1/v2 โ†’ Text-to-image โ†’ Simple & fast โ†’ 2 custom nodes

๐ŸŽฎ ControlNet โ†’ Guided generation โ†’ 5 control types โ†’ 3 nodes + ControlNet file

๐ŸŽฒ Seed Enhancer โ†’ Output diversity โ†’ More variations โ†’ SeedVarianceEnhancer node

๐ŸŽฌ SeedVR2 โ†’ Professional upscaling โ†’ Diffusion-based โ†’ SeedVR2 node + models

๐ŸŒŠ DA3 DepthV3 โ†’ Depth-guided + 3D โ†’ Dual modes โ†’ Depth Anything 3 nodes

๐Ÿ–ผ๏ธ Anime โ†’ Anime style โ†’ Custom merged โ†’ 5 custom nodes

Base Workflow (28-50 Steps, CFG 3.0-5.0)

๐Ÿ†• Base-AIO โ†’ Full control โ†’ CFG + Negative prompts โ†’ 5 custom nodes


๐Ÿ”„ Model Versions Available

Z-Image-Turbo-AIO (8 Steps, CFG 1.0)

๐ŸŸก FP8-AIO (~10GB) - Recommended for most users

๐Ÿ”ต FP16-AIO (~20GB) - Wide GPU compatibility

๐ŸŒŸ BF16-AIO (~20GB) - Maximum quality

Z-Image-Base-AIO (28-50 Steps, CFG 3.0-5.0) ๐Ÿ†•

๐ŸŸก FP8-AIO (~10GB) - Fast, daily use

๐Ÿ”ต FP16-AIO (~20GB) - Wide GPU compatibility (RTX 2000/3000)

๐ŸŒŸ BF16-AIO (~20GB) - Max quality, ideal for LoRA training

All versions work on 8GB VRAM!


๐Ÿ†š Turbo vs Base - When to Use Which?

Use Turbo when:

โšก Speed is priority - 8 steps = 3-5 seconds

๐Ÿ“ธ Production workflows - Consistent high quality

๐Ÿ’พ Quick iterations - Rapid prototyping

๐ŸŽฏ Simple prompts - Less complex scenes

Use Base when:

๐ŸŽจ Creative exploration - Higher diversity across seeds

๐Ÿ”ง LoRA/ControlNet development - Undistilled foundation

๐Ÿ“ Complex prompt engineering - Full CFG control

๐Ÿšซ Negative prompting needed - Remove unwanted elements

๐ŸŽฏ Maximum control - Fine-tune every aspect


โš™๏ธ Settings by Model Type

Z-Image-Turbo Settings (All Turbo Workflows)

๐Ÿ“Š Steps: 8-9 ๐ŸŽš๏ธ CFG: 1.0 (don't change!) ๐Ÿšซ Negative Prompt: โŒ Not used ๐ŸŽฒ Sampler: res_multistep (sharp) / euler_ancestral (smooth) ๐Ÿ“ˆ Scheduler: simple (clean) / beta (balanced)

Z-Image-Base Settings (Base-AIO Workflow) ๐Ÿ†•

๐Ÿ“Š Steps: 28-50 ๐ŸŽš๏ธ CFG: 3.0-5.0 ๐Ÿšซ Negative Prompt: โœ… Full support! ๐ŸŽฒ Sampler: euler โญ / dpmpp_2m ๐Ÿ“ˆ Scheduler: normal โญ / karras


๐Ÿ’ก Prompting Tips

Natural Language Works Best

Professional food photography of artisan breakfast plate. 
Golden poached eggs on sourdough toast, crispy bacon, fresh 
avocado slices. Morning sunlight creating warm glow. Shallow 
depth of field, magazine-quality presentation.

Bilingual Text Rendering

Neon sign reading "COFFEE SHOP" in bright blue letters
Sign with "ๅ’–ๅ•กๅบ—" in elegant gold calligraphy

Important

Turbo:

  • โŒ NO negative prompts (model ignores them)

  • โœ… Natural language, not tags

  • โœ… Detailed (100-300 words)

Base:

  • โœ… Negative prompts work great!

  • โœ… Natural language, not tags

  • โœ… Detailed (100-300 words)

  • โœ… Use CFG 3.0-5.0 for control


๐Ÿ“ฅ Downloads

Main Models

Z-Image-Turbo-AIO:

Z-Image-Base-AIO: ๐Ÿ†•

Additional Files


๐Ÿ“ฆ Custom Nodes

Required for ALL Workflows

rgthree-comfy https://github.com/rgthree/rgthree-comfy

comfyui_image_metadata_extension https://github.com/edelvarden/comfyui_image_metadata_extension

Additional per Workflow

ZIB-AIO-Base (Base Workflow): ๐Ÿ†•

ZIT-AIO-Control:

ZIT-AIO-Variance:

ZIT-AIO-SeedVR2:

ZIT-AIO-DepthV3:

ZIT-AIO-Anime:

๐Ÿ’ก Tip: Use ComfyUI Manager โ†’ "Install Missing Custom Nodes" for easy installation!


๐ŸŽฏ Workflow-Specific Details

๐Ÿ†• Base-AIO (ZIB-AIO-Base)

  • Steps: 28-50 (more = better quality)

  • CFG: 3.0-5.0 (4.0 recommended start)

  • Sampler: euler (sharp) / dpmpp_2m (smooth)

  • Scheduler: normal (standard) / karras (refined)

  • Negative prompts: โœ… Full support!

  • Upscaler: Optional with denoise 0.35

  • LoRA: Optional via LoraManager node

  • SeedVR2: Optional for AI upscaling

  • Seed Variance: Optional for diversity

๐Ÿš€ Standard v2.0

  • Improved upscaler with denoise control (0.4-0.6)

  • Dual sampler support

  • scale_by parameter for output size

  • Perfect for everyday use

๐ŸŽฎ ControlNet

  • 5 control types: Canny, HED, Depth, Pose, MLSD

  • Megapixel scaling (auto aspect ratio)

  • ControlNet strength: 0.6-0.8 recommended

  • โš ๏ธ Save ControlNet in: ComfyUI/models/model_patches/

๐ŸŽฒ Seed Variance Enhancer

  • randomize_percent: 50

  • strength: 20-30

  • noise_insert: 'noise on beginning steps'

  • Trade-off: Diversity vs prompt adherence

๐ŸŽฌ SeedVR2

  • resolution: 1536 (target for short edge)

  • batch_size: MUST be 4n+1 (1, 5, 9, 13, 17, 21...)

  • color_correction: 'lab' (recommended)

  • Models: 3B (faster) or 7B (higher quality)

๐ŸŒŠ Depth Anything V3

  • Two modes: Generate images OR 3D preview

  • Models: da3_small/base/large/giant

  • 8GB VRAM: Use up to da3_large

  • โš ๏ธ Toggle correctly: Generate mode โ‰  3D Preview mode


โ“ FAQ

Q: Which workflow should I use? A: Turbo Standard v2 for quick text-to-image. Base-AIO for full control & LoRA work. Others for specific needs.

Q: FP8, FP16, or BF16? A: FP8 for most users (10GB). FP16 for older GPUs. BF16 for maximum quality (20GB).

Q: Turbo or Base? A: Turbo for speed (8 steps). Base for control & quality (28-50 steps, CFG, negative prompts).

Q: Which sampler is better? A: Turbo: res_multistep (sharp) or euler_ancestral (smooth). Base: euler or dpmpp_2m.

Q: Metadata saved automatically? A: Yes! All workflows save metadata for easy CivitAI uploads.

Q: Works on 8GB VRAM? A: Yes! All workflows tested on RTX 4060 8GB.

Q: Do I need all custom nodes? A: No! Only install nodes for workflows you'll use. Base requirement is 2 nodes (rgthree + metadata).

Q: Can I use LoRAs with Base? A: Yes! Base-AIO includes LoraManager for easy LoRA loading. Turbo doesn't support LoRA training well.


๐ŸŒŸ Key Features

All Workflows

โœจ Fast Generation - Turbo: 3-5 sec | Base: 30-60 sec ๐Ÿ“ฆ All-in-One - VAE + Text Encoder integrated ๐Ÿ“ธ Photorealistic - Professional quality ๐Ÿ“– Bilingual - English & Chinese text rendering ๐Ÿ’พ Metadata Auto-Save - Easy sharing ๐ŸŽฏ 8GB VRAM Friendly - Accessible to everyone

Base-Specific ๐Ÿ†•

โš™๏ธ Full CFG Control - 3.0-5.0 range ๐Ÿšซ Negative Prompts - Remove unwanted elements ๐ŸŽจ High Diversity - More variation across seeds ๐Ÿ”ง LoRA Ready - Ideal for training & using LoRAs

Turbo-Specific

โšก Ultra-Fast - 8-9 steps only ๐ŸŽฏ Consistent - Same quality every time ๐Ÿ’พ Efficient - Lower compute needed


๐ŸŽจ Perfect For

Base-AIO:

  • LoRA training & testing

  • Complex compositions

  • Fine-tuned control

  • Professional projects

  • Creative exploration

Turbo Workflows:

  • Quick iterations

  • Production workflows

  • Social media content

  • Marketing materials

  • Rapid prototyping

Both:

  • Product photography

  • Architectural visualization

  • Food photography

  • Portrait photography

  • Bilingual content


๐Ÿ“ System Requirements

Minimum:

  • VRAM: 8GB

  • RAM: 16GB

  • ComfyUI: v0.11.0+

Recommended:

  • VRAM: 8GB+ (perfect for all workflows)

  • RAM: 32GB

  • Storage: 50GB+ (for all models)

Tested Hardware:

  • RTX 4060 8GB @ 1920ร—1088

  • All FP8, FP16, and BF16 versions work perfectly


๐Ÿ™ Credits

Original Model: Tongyi Lab (Alibaba Group) T

ext Encoder: Qwen3-4B

ControlNet Union: Alibaba PAI Team

SeedVR2: ByteDance Seed Team

Depth Anything V3: ByteDance Seed Team

Architecture: Single-Stream DiT (6B parameters)

License: Apache 2.0

Workflows: Optimized for ComfyUI with metadata support

Community: Thanks to all testers and contributors!


๐Ÿ“Š File Sizes

Main Models:

  • Turbo FP8-AIO: ~10GB

  • Turbo FP16-AIO: ~20GB

  • Turbo BF16-AIO: ~20GB

  • Base FP8-AIO: ~10GB

  • Base FP16-AIO: ~20GB

  • Base BF16-AIO: ~20GB

Additional Files:

  • ControlNet Union: ~2.5GB

  • SeedVR2 models: 10-20GB (3B-7B variants)

  • Depth Anything V3: 80MB-1.15GB (model dependent)

Total (all workflows): ~70GB for complete setup


๐ŸŽฏ Getting Started

1๏ธโƒฃ Download model (FP8, FP16, or BF16 - Turbo or Base)

2๏ธโƒฃ Install base custom nodes (rgthree + metadata)

3๏ธโƒฃ Choose workflow based on your needs

4๏ธโƒฃ Install workflow-specific nodes if needed

5๏ธโƒฃ Load workflow into ComfyUI v0.11.0+

6๏ธโƒฃ Generate!


Updated: January 2026 Tested: RTX 4060 8GB @ 1920ร—1088 ComfyUI: v0.11.0+ required


Eight powerful workflows for every creative need! ๐Ÿš€

Turbo for speed | Base for control | Choose what fits your workflow!

ๆญคๆจกๅž‹็”Ÿๆˆ็š„ๅ›พๅƒ