Z-Image Turbo FP8 [Kijai]

Details

Model description

Text Encoder | VAE

If you're already using SDXL, Flux, or Qwen-based generators, here's the simple version of what Z-Image-Turbo is:

It's a super-fast text-to-image model from Alibaba that spits out 1024×1024 pictures in less than a second on a single high-end card (or a couple of seconds on a 3090/4090). Speed comes from heavy distillation , it’s basically a 6B model taught by much bigger internal beasts, so it feels closer to closed top-tier models than most open-source stuff.

Compared to what you know:

  • Faster than Flux and SDXL (the SD3.5 Turbo is a bit faster in my testing.)

  • Better prompt following and prettier results than most current open models

  • Renders English and Chinese text in images almost perfectly

  • Currently sitting at #1 on the public human-voted leaderboard (AI Arena Elo)

Tongyi-MAI HF | GitHub

The model is mirrored here for convenience.

Images made by this model

No Images Found.