LoRA Dataset Caption Aid with Ollama Vision Uncensored

This workflow is used to get a starting point when creating captions for a character LoRA. You feed it an image and it will analyze the image and provide a well rounded description aimed for Character LoRA that should include all the relevant details required for proper captioning.

I found a good Ollama model with vision capability that isn't censored and fed it with a detailed prompt on how to craft the proper caption for a Character LoRA. The Ollama model is about 7GB and should fit on almost any GPU if ran on its own.

An additional "hint" box is added, so the user can provide a hint for the Vision model for exceptional situation like extreme close-up where the image analysis can get wrong if the AI isn't provided with at least minimal context. Leave that box empty in most cases.

I have also added a few markdown notes with tips for the best possible captioning.

모델 유형	워크플로우
기본 모델	Flux.1 D
게시일	2025-08-10

LoRA Dataset Caption Aid with Ollama Vision Uncensored

세부 정보

파일 다운로드 (1)

이 버전에 대해

모델 설명

이 모델로 만든 이미지