Ovis-Image GGUF Text-to-Image Workflow by Sarcastic TOFU

This is a very simple ComfyUI beginner friendly Text-to-Image workflow that will work with a single Ovis-Image GGUF models running on relatively entry level GPUs (8 to 12 GB VRAM for Nvidia/AMD, 16 GB Base Unified Memory on Apple Silicon Mac with M series Processors like M3, M4, M5 etc.). Ovis-Image, Released in late November 2025, is an open-source 7-billion-parameter text-to-image generation model developed by the AIDC-AI team at Alibaba (Alibaba International Digital Commerce Group). It delivers legible, correctly spelled, and semantically consistent text across diverse fonts, sizes, layouts, and aspect ratios in English or Chinese text. Also, Ovis-Image is specifically optimized for high-quality text rendering in generated images, making it ideal for text-heavy prompts such as posters, banners, logos, UI mockups, infographics, social media graphics, and marketing materials. This model achieves text rendering quality comparable to much larger models (e.g., 20B+ class like Qwen-Image) and competitive with closed-source systems like GPT-4o or Seedream. In my experience it can sometimes generate AI Photoes similar to Z-Image Turbo model but it works better on simpler outputs that deals with text-heavy prompts. This is definately worth trying simply for this!

How to use this -

#1. Just select your desired Ovis-Image GGUF model first and then

#2. select image output dimensions to start

#3. then input your positive and negative prompts.

#4. select how many images you want (Change the number besides the "Run" button)

#5. now set sampling methods, CFG, steps etc. settings and any other optional settings

#6. finally press the run button to generate. That's it..

Enjoy!

## Required Models

======================

### Download Links for Ovis-Image GGUF Checkpoint -

https://huggingface.co/convertor/ovis-image-gguf/resolve/main/ovis-image-iq4_nl.gguf

### Download Links for Ovis-Image GGUF Encoder -

https://huggingface.co/convertor/ovis-image-gguf/resolve/main/qwen3_vl_2b_f32-iq4_nl.gguf

### Download Links for Ovis-Image GGUF VAE (This is just Flux GGUF VAE) -

https://huggingface.co/convertor/ovis-image-gguf/resolve/main/pig_flux_vae_fp32-f16.gguf

模型类型	工作流
基础模型	Other
发布时间	2025-12-19

Ovis-Image GGUF Text-to-Image Workflow by Sarcastic TOFU

详情

下载文件 (1)

模型描述

此模型生成的图像