ERNIE-AIO
μΈλΆ μ 보
νμΌ λ€μ΄λ‘λ (2)
μ΄ λ²μ μ λν΄
λͺ¨λΈ μ€λͺ
π§ ERNIE-Image AIO for ComfyUI
All-in-One Checkpoints β’ BF16 & FP8 β’ Text Encoder + VAE Integrated β’ Simple ComfyUI Setup
β¨ What is this?
This release provides AIO (All-in-One) checkpoint versions of ERNIE-Image and ERNIE-Image-Turbo for ComfyUI.
Instead of loading separate model parts, these AIO versions already include the image model, text encoder, and VAE inside a single checkpoint file. Just place the model into your checkpoint folder, load it with the regular Checkpoint Loader node, and build a very small workflow around it.
The original ERNIE-Image family from Baidu focuses on strong instruction following, good text rendering, structured compositions, and support for a broad range of styles, from design-oriented images to more realistic or stylized outputs.


huggingface.co Releases
Workflow Releases
π¦ Included Versions
This release contains AIO checkpoints for:
ERNIE-Base-AIO
ERNIE-Turbo-AIO
Each version is provided in:
BF16
FP8
βοΈ AIO Format
These checkpoints are made for a simple ComfyUI workflow.
Install path
Place the .safetensors files here:
\ComfyUI\models\checkpoints
Then load them with:
Checkpoint Loader
CLIP Text Encode (Prompt)
Empty Latent Image
KSampler
VAE Decode
Save Image
That is all you theoretically need to run the model.
π§ͺ Quantization Notes
The FP8 versions in this release were quantized by me.
For these FP8 AIO builds:
Image model: FP8
Text encoder: FP8
VAE: FP32
This setup was chosen to reduce size and improve practicality while keeping decoding stable and reliable.
πΌοΈ Recommended Usage
ERNIE-Base-AIO
Best for: general-purpose generation, stronger prompt fidelity, more deliberate results
Recommended CFG: 4.0
Recommended steps: 50
ERNIE-Turbo-AIO
Best for: fast generation, quick iteration, efficient workflows
Recommended CFG: 1.0
Recommended steps: 8
The official ERNIE-Image documentation recommends the same core resolutions for both releases, including:
1024Γ1024
848Γ1264
1264Γ848
768Γ1376
896Γ1200
1376Γ768
1200Γ896
π§Ύ Prompting
ERNIE models work best with natural language prompts.
Describe:
subject
scene
composition
lighting
mood
camera framing
details you actually want in the image
These models are especially interesting for:
posters
text-heavy designs
comics
structured layouts
prompt-sensitive image generation
π§ ComfyUI Notes
These AIO checkpoints are designed for a checkpoint-style workflow, so you do not need to manually load separate text encoder or VAE files.
A workflow for this release will also be uploaded separately.
π Credits
Huge thanks to the original ERNIE-Image team at Baidu for releasing these models and their research to the community. The original official releases are available here:
Official GitHub Repository
Both official model pages list the release under the Apache-2.0 license.
β€οΈ Final Note
I made these AIO versions to make ERNIE-Image easier to use in ComfyUI β especially for people who prefer the classic single-checkpoint workflow.
Load checkpoint, prompt, sample, decode, save β done.




















