img2prompt

img2prompt is a workflow for ComfyUI that allows users to generate detailed image captions and prompts from input images using state-of-the-art models. The generated prompts are saved to a file for later review and reuse.

This workflow integrates the following models:

WD14
MiaoshouAI
JoyCaption 2
Florence

Features

Converts input images into detailed and accurate captions or prompts.
Utilizes multiple popular image captioning models for diverse output styles.
Saves all generated prompts into a file for easy access and future use.

How to Use

Load an input image in ComfyUI.
Run the img2prompt workflow.
The workflow will process the image using the integrated models and generate prompts/captions.
All generated prompts are automatically saved to a file for later use.

Output

The generated prompts are saved in a structured text file. This allows you to:

Review and edit the prompts.
Use the prompts in other workflows or tools.

Models Overview

WD14: A powerful model for generating descriptive tags and keywords.
MiaoshouAI: Known for generating natural and context-aware captions.
JoyCaption 2: Provides highly descriptive and aesthetic captions.
Florence: A versatile captioning model that excels in creating detailed scene descriptions.

Acknowledgments

Special thanks to the creators of ComfyUI and the developers of the integrated models:

Screenshots

Enjoy generating amazing image prompts with img2prompt!

Model Type	Workflows
Base Model	Other
Published	2025-06-09

img2caption

Details

Download Files (1)

About this version

Model description