img2caption

Details

Download Files

Model description

img2prompt

img2prompt is a workflow for ComfyUI that allows users to generate detailed image captions and prompts from input images using state-of-the-art models. The generated prompts are saved to a file for later review and reuse.

This workflow integrates the following models:

  1. WD14

  2. MiaoshouAI

  3. JoyCaption 2

  4. Florence

Features

  • Converts input images into detailed and accurate captions or prompts.

  • Utilizes multiple popular image captioning models for diverse output styles.

  • Saves all generated prompts into a file for easy access and future use.

How to Use

  1. Load an input image in ComfyUI.

  2. Run the img2prompt workflow.

  3. The workflow will process the image using the integrated models and generate prompts/captions.

  4. All generated prompts are automatically saved to a file for later use.

Output

The generated prompts are saved in a structured text file. This allows you to:

  • Review and edit the prompts.

  • Use the prompts in other workflows or tools.

Models Overview

  • WD14: A powerful model for generating descriptive tags and keywords.

  • MiaoshouAI: Known for generating natural and context-aware captions.

  • JoyCaption 2: Provides highly descriptive and aesthetic captions.

  • Florence: A versatile captioning model that excels in creating detailed scene descriptions.

Acknowledgments

Special thanks to the creators of ComfyUI and the developers of the integrated models:

Screenshots


Enjoy generating amazing image prompts with img2prompt!

Images made by this model

No Images Found.