Z-Image-Turbo/Base-AIO

่ฏฆๆƒ…

ไธ‹่ฝฝๆ–‡ไปถ (1)

ๆจกๅž‹ๆ่ฟฐ

๐Ÿš€ Z-Image AIO Collection

โšก Base & Turbo โ€ข All-in-One โ€ข Bilingual Text โ€ข Qwen3-4B


โš ๏ธ IMPORTANT: Requires ComfyUI v0.11.0+

๐Ÿ“ฅ Download ComfyUI


โœจ What is Z-Image AIO?

Z-Image AIO is an All-in-One repackage of Alibaba Tongyi Lab's 6B parameter image generation models.

Everything integrated:

  • โœ… VAE already built-in

  • โœ… Qwen3-4B Text Encoder integrated

  • โœ… Just download and generate!


๐ŸŽฏ Available Versions


๐Ÿ”ฅ Z-Image-Turbo-AIO (8 Steps โ€ข CFG 1.0)

Ultra-fast generation for production & daily use


โšซ NVFP4-AIO (7.8 GB) ๐Ÿ†•

๐ŸŽฏ ONLY for NVIDIA Blackwell GPUs (RTX 50xx)!
โšก Maximum speed optimized
๐Ÿ’พ Smallest file size
๐Ÿš€ FP4 precision - blazing fast

Perfect for: RTX 5070, 5080, 5090 owners who want maximum speed


๐ŸŸก FP8-AIO (10 GB) โญ RECOMMENDED

โœ… Best balance of size & quality
โœ… Works on 8GB VRAM
โœ… Fast downloads
โœ… Ideal for most users

Perfect for: Daily use, testing, RTX 3060/4060/4070


๐Ÿ”ต FP16-AIO (20 GB)

๐Ÿ’พ Same file size as BF16
๐Ÿ”„ ComfyUI auto-casts to BF16 for compute
โš ๏ธ Does NOT enable FP16 compute mode
๐Ÿ“ฆ Alternative download option

Note: Z-Image does not support FP16 compute - activation values exceed FP16's max range, causing NaN/black images. Weights are cast to BF16 during inference regardless of file format.

Perfect for: Alternative to BF16 download (identical inference behavior)


๐ŸŒŸ BF16-AIO (20 GB) โญ RECOMMENDED FOR FULL PRECISION

โœ… BFloat16 full precision
โœ… Absolute best quality
โœ… Professional projects
โœ… Also works on 8GB VRAM

Perfect for: Professional work, maximum quality


๐ŸŽจ Z-Image-Base-AIO (28-50 Steps โ€ข CFG 3-5)

Full creative control for pros & LoRA training


๐ŸŸก FP8-AIO (10 GB)

โœ… Efficient for daily use
โœ… Full CFG control
โœ… Negative prompts supported
โœ… 8GB VRAM compatible

Perfect for: Daily work with full control


๐Ÿ”ต FP16-AIO (20 GB)

๐Ÿ’พ Same file size as BF16
๐Ÿ”„ ComfyUI auto-casts to BF16 for compute
โš ๏ธ Does NOT enable FP16 compute mode
๐Ÿ“ฆ Alternative download option

Note: See technical explanation in FAQ below.

Perfect for: Alternative to BF16 download (identical inference behavior)


๐ŸŒŸ BF16-AIO (20 GB) โญ RECOMMENDED FOR FULL PRECISION

โœ… Maximum quality
โœ… Ideal for LoRA training
โœ… Professional projects
โœ… Highest precision

Perfect for: LoRA training, professional work


๐Ÿ†š Turbo vs Base - When to Use?


โšก Use TURBO when:

โšก Speed is priority โ†’ 8 steps = 3-10 seconds
๐Ÿ“ธ Production workflows โ†’ Consistent high quality
๐Ÿ’พ Quick iterations โ†’ Rapid prototyping
๐ŸŽฏ Simple prompts โ†’ Less complex scenes

๐ŸŽจ Use BASE when:

๐ŸŽจ Creative exploration โ†’ Higher diversity
๐Ÿ”ง LoRA/ControlNet dev โ†’ Undistilled foundation
๐Ÿ“ Complex prompting โ†’ Full CFG control
๐Ÿšซ Negative prompts needed โ†’ Remove unwanted elements

โš™๏ธ Recommended Settings


โšก Turbo Settings (incl. NVFP4)

๐Ÿ“Š Steps: 8
๐ŸŽš๏ธ CFG: 1.0 (don't change!)
๐ŸŽฒ Sampler: res_multistep OR euler_ancestral
๐Ÿ“ˆ Scheduler: simple OR beta
๐Ÿ“ Resolution: 1920ร—1088 (recommended)
๐Ÿšซ Negative Prompt: โŒ Not used!

๐ŸŽจ Base Settings

๐Ÿ“Š Steps: 28-50
๐ŸŽš๏ธ CFG: 3.0-5.0 (start with 4.0)
๐ŸŽฒ Sampler: euler โญ OR dpmpp_2m
๐Ÿ“ˆ Scheduler: normal โญ OR karras
๐Ÿ“ Resolution: 512ร—512 to 2048ร—2048
๐Ÿšซ Negative Prompt: โœ… Fully supported!

๐Ÿ“Š Quick Overview


Turbo Versions

โšซ NVFP4  โ”‚ 7.8 GB  โ”‚ RTX 50xx only  โ”‚ Max Speed ๐Ÿ†•
๐ŸŸก FP8   โ”‚ 10 GB   โ”‚ 8GB VRAM       โ”‚ Recommended โญ
๐Ÿ”ต FP16  โ”‚ 20 GB   โ”‚ โ†’ BF16 compute โ”‚ See FAQ โš ๏ธ
๐ŸŒŸ BF16  โ”‚ 20 GB   โ”‚ 8GB VRAM       โ”‚ Max Quality โญ

Base Versions

๐ŸŸก FP8   โ”‚ 10 GB   โ”‚ 8GB VRAM       โ”‚ Efficient
๐Ÿ”ต FP16  โ”‚ 20 GB   โ”‚ โ†’ BF16 compute โ”‚ See FAQ โš ๏ธ
๐ŸŒŸ BF16  โ”‚ 20 GB   โ”‚ 8GB VRAM       โ”‚ LoRA Training โญ

๐Ÿ’ก Prompting Guide


โœ… Good Example:

Professional food photography of artisan breakfast plate. 
Golden poached eggs on sourdough toast, crispy bacon, fresh 
avocado slices. Morning sunlight creating warm glow. Shallow 
depth of field, magazine-quality presentation.

โŒ Bad Example:

breakfast, eggs, bacon, toast, food, morning, plate

๐Ÿ“ Tips

DO:

  • โœ… Use natural language

  • โœ… Be detailed (100-300 words)

  • โœ… Describe lighting & mood

  • โœ… Specify camera angle

  • โœ… English OR Chinese (or both!)

DON'T:

  • โŒ Tag-style prompts (tag1, tag2, tag3)

  • โŒ Very short prompts (under 50 words)

  • โŒ Negative prompts with Turbo


๐ŸŒ Bilingual Text Rendering


English:

Neon sign reading "OPEN 24/7" in bright blue letters 
above entrance. Modern sans-serif font, glowing effect.

ไธญๆ–‡:

Traditional tea house entrance with sign reading 
"ๅค้Ÿต่ŒถๅŠ" in elegant gold Chinese calligraphy.

Both:

Modern cafe with bilingual sign. "Morning Brew" in 
white script above, "ๆ™จๆ›ฆๅ’–ๅ•ก" in Chinese below.

๐Ÿ“ฅ Installation


Step 1: Download

Choose your version based on:

  • GPU: RTX 50xx โ†’ NVFP4 possible

  • VRAM: 8GB โ†’ FP8 recommended

  • Purpose: LoRA Training โ†’ Base BF16


Step 2: Place File

ComfyUI/models/checkpoints/
โ””โ”€โ”€ Z-Image-Turbo-FP8-AIO.safetensors

Step 3: Load & Generate

  1. Open ComfyUI (v0.11.0+!)

  2. Use "Load Checkpoint" node

  3. Select your AIO version

  4. Generate!

No separate VAE or Text Encoder needed!


๐Ÿ™ Credits


Original Model

๐Ÿ‘จโ€๐Ÿ’ป Developer: Tongyi Lab (Alibaba Group)
๐Ÿ—๏ธ Architecture: Single-Stream DiT (6B parameters)
๐Ÿ“œ License: Apache 2.0

Links

๐Ÿ”— Z-Image Base: https://huggingface.co/Tongyi-MAI/Z-Image

๐Ÿ”— Z-Image Turbo: https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

๐Ÿง  Text Encoder: https://huggingface.co/Qwen/Qwen3-4B


๐Ÿ“ˆ Version History


v2.2 - FP16 Clarification

๐Ÿ“ Updated FP16 descriptions for technical accuracy
โš ๏ธ Clarified: FP16 weights โ‰  FP16 compute
๐Ÿ”„ FP16 files are cast to BF16 during inference

v2.1 - NVFP4 Release ๐Ÿ†•

โž• Z-Image-Turbo-NVFP4-AIO (7.8GB)
โšก Optimized for NVIDIA Blackwell (RTX 50xx)
๐Ÿš€ Maximum speed generation

v2.0 - Base AIO Release

โž• Z-Image-Base-BF16-AIO
โž• Z-Image-Base-FP16-AIO
โž• Z-Image-Base-FP8-AIO
๐Ÿ”„ ComfyUI v0.11.0+ support
๐Ÿ“ Qwen3-4B Text Encoder

v1.1 - FP16 Added

โž• Z-Image-Turbo-FP16-AIO
๐Ÿ”ง Wider GPU compatibility

v1.0 - Initial Release

โœ… Z-Image-Turbo-FP8-AIO
โœ… Z-Image-Turbo-BF16-AIO
โœ… Integrated VAE + Text Encoder

โ“ FAQ


Q: Which version should I choose?

RTX 50xx + Speed โ†’ NVFP4 ๐Ÿ†•
Most users       โ†’ Turbo FP8 โญ
Full precision   โ†’ BF16 โญ
LoRA Training    โ†’ Base BF16

Q: Turbo or Base?

Fast & simple    โ†’ Turbo โšก
Full control     โ†’ Base ๐ŸŽจ

Q: Will NVFP4 work on my RTX 4090?

โŒ No! NVFP4 is only for RTX 50xx (Blackwell architecture).

Use FP8 instead for RTX 40xx and older.


Q: Do I need separate VAE/Text Encoder?

โŒ No! Everything is already integrated.

Just Load Checkpoint and go!


Q: Works on 8GB VRAM?

โœ… Yes! All versions work on 8GB VRAM.

(NVFP4 requires RTX 50xx regardless of VRAM)


โš ๏ธ Q: What about FP16 for older GPUs (RTX 2000/3000)?

Important technical clarification:

Z-Image does NOT support FP16 compute type. Here's why:

๐Ÿ“Š Technical reason:
- FP16 max value: ~65,504
- BF16 max value: ~3.39e+38 (same as FP32)
- Z-Image's activation values exceed FP16's range
- Result: Overflow โ†’ NaN โ†’ Black images

What actually happens:

  • ComfyUI automatically casts weights to BF16 for computation

  • You can see this in logs: "model weight dtype X, manual cast: torch.bfloat16"

  • "Weight dtype" (file format) โ‰  "Compute dtype" (actual calculation)

For RTX 20xx users (no native BF16):

  • BF16 is emulated via FP32 = slower but works

  • There is no way to run Z-Image in true FP16 compute

  • FP8 with CPU offload may be a better option for limited VRAM

TL;DR: FP16 and BF16 files behave identically during inference. Choose based on download preference, not GPU compatibility.


๐Ÿš€ Get Started Now!

Download โ†’ Load Checkpoint โ†’ Generate!

Recommended versions:

  • ๐ŸŸก FP8 for most users (best size/quality balance)

  • ๐ŸŒŸ BF16 for maximum quality

  • โšซ NVFP4 for RTX 50xx speed

All versions work on 8GB VRAM


Happy generating! ๐ŸŽจ

ๆญคๆจกๅž‹็”Ÿๆˆ็š„ๅ›พๅƒ