Friendly Z-Image-Turbo
Details
Download Files
About this version
Model description
Welcome to my 💫🌄 Friendly Z-Image-Turbo
✨ Less mess, more magic
Z-Image-Turbo - everyone already knows it's overkill!)
Z-Image is a 6B parameter efficient image generation foundation model with new Scalable Single-Stream DiT (S3-DiT) architecture where text, visual semantic tokens, and image VAE tokens are concatenated at the sequence level to serve as a unified input stream, maximizing parameter efficiency. It has a great understanding of prompts, styles and has a high attention to detail.
🚀 Z-Image-Turbo has a speed and quality balance advantage over other local models such as Flux 1, Flux 2, Qwen and others!
I offer my workflow, with which you can generate on a full BF16 model without sacrificing quality and at the same speed. This requires at least 6GB of video memory and at least 32GB of RAM. No Tritons or SageAttention! Simply configure the required amount of video memory dumped to RAM in the "VRAM Optimizer+" node.
The latest version of ComfyUI is required.
💻 System requirements:
- Minimum system requirements:
RTX 3000-s, 6GB VRAM, 32GB+ RAM, 8-core processor, SSD, latest ComfyUI
📌 Detailed tips and links in the workflow
✨ Workflow features:
Extremely user-friendly interface
Maximum performance even on low-end systems with at least 6GB of VRAM
Support for up to 2 Lora
Detailed tooltips, including recommended samplers for best quality
Manual random seed for complete control over generations
🤗🙏🏼 Thanks to Tongyi-MAI
Original repo — GitHub









