CHILI - because of sharp focus on main subject and photorealistic rendering of body details.

MANGO - because of sweet, lovely, bokehlicious scene composition with outstanding separation of main subject.

I tried to train it hard on artistic beautiful female anatomy with keeping it creative enough to render interesting intricate scenery with bokeh.

FEEDBACKs are WELCOME!

Prompts suggestions:

"swirly bokeh, smooth bokeh, depth of field, analog film grain, vintage lens" - to amplify the separation of main subject and lifelike look of the picture
"realistic skin texture, natural skin, perfect anatomy, anatomically correct body, perfect body, anatomically correct fingers, perfect face anatomy, anatomically correct eyes, ideal round cornea" - to increase the chance to avoid anatomical defects and amplify the realism of human body
Use DPM++ 2S a Karras sample - it is more stable. Other samplers with Karras scheduler also good. UniPC shows itself unstable but 2x faster and sometimes also gives satisfying results. Euler a - not tested (should give performance comparable with UniPC, but UniPC usually better). DDIM - sometimes good in ADetailer or HighRes Fix. Heun also useful for HR Fix and ADetailer only.
Use HighRes Fix with Latent upscalers if you want to add much more details. Use Img2Img based upcalers like UltraSharp 4x if you want to minimize aliasing issues. I prefer Latent scalers with Antialiasing. Latent upscaling can inject absent details into low-res picture. I2I upscalers aren't smart enough to enrich the 1st pass img.
Use AfterDetailer. In most cases you can be satisfied by only one ADetailer pass with "mediapipe_face_mesh_eyes_only". Set bigger "Inpaint mask blure" (I use 24px) and "Inpaint only masked padding" (I use 128). Use 1280 resolution for this pass and ADetailer will give you outstanding face and eyes quality.
Use "temporal trick" [prompt a : prompt b : nn] to specify different Loras for different stages of rendering. Thi way you can use body related Loras on early steps and eyes / skin details related Loras on later steps.

Intro

I made it just for fun and as experiment to build a model good for augmenting professional photographs. I am using Nikon camera with bunch of vintage lens. I expect to build an SD model which is able to produce moody, cinematic pictures with nice smooth bokeh and "analog style". Please note, that I don't plan to train this model on any hardcore nsfw. Don't expect / request it from "cinero" models ;) My preference is art, beauty and emotions.

Some tips on Prompting

Few examples:

"[grayscale : [dimmed colors : vibrant color splashes : 16] : 8]" - I call it "temporal trick". What it does is just make your prompt depending on current step. With this prompts SD will use "grayscale" on steps 1..7. SD will use "dimmed colors" on steps 8..15. On further steps SD will use "vibrant color splashes". I believe that there is no strict limits on nesting level. What you can do with it? You can effectively reduce the number of tokens SD process on each step (reduce the length of active prompt). On the first steps there is no sens to specify fine details. You only need to specify the scenery roughly. On the later steps there is no sense to spend tokens on describing the composition and lighting (I suspect). So, in theory, with this trick and big number of steps you can keep your prompt short and have build very rich prompt at the same time.
PS: this prompt above force the SD to draw the scene with very little colors and super vibrant segments (I showed grayscale images where a subject has few vibrant hair curls or clothes parts). Probably, you can reverse this effect by making whole picture colored with some parts made grayscale.
[Audrey Hepburn : Milla Jovovich : 16] - you can have fun with smooth transition from one face to another with XYZ plot script in Automatic1111. Also, this particular temporal trick with face / body helps my model to render most realistic and correct anatomy. I suspect you can also implement dynamic LoRa weighting with this trick. If LoRa don't have a trigger word you can just put the LoRa token like [ <lora: ...:0.42> : <lora: ...:0.99> : 16] or you can use multiple levels of nested "trigger words" from different loras.
"shot on %Brand Name% %Lens Mark Name% vintage lens" - if you find the vintage lens names which SD have in its memory, then you have a chance to improve an "analog style" of your picture. I used to use "Carl Zeiss Sonar", "Nokton", "Helios 44-2", but cannot confirm that each particular lens model gives unique effect. If you have you own list of confirmed lens models, then please consider to share it with community in comments to this model [%PICTURE OF LEELOO saying HELP%]
In near future I plan to build a training dataset with many images shot on beautiful vintage lenses to bring old-school photography soul into this model. I will use some unique trigger word for that or will use a "vintage lens" (not sure yet).
use "perfect anatomy", "anatomically correct body", "anatomically correct hand", "perfect hands", "anatomically correct fingers", "perfect limbs anatomy" and similar anatomical phrases to increase the chance to get correct anatomy.
Us words smooth bokeh, swirly bokeh, depth of field, smooth background to increase the separation of main subject and scenery.
Use "turbulent fog", "mist" and "haze" with "mystical lighting" to get nice atmospheric picture with super noticeable depth of scene. Also use "early morning" and "blue hour" phrases if you want to get cold morning vibes.
Use "scary face expression", "surprised expression", "inviting expression", "lustful face", etc to increase the chance to get noticeable emotions on face and visible "body language". It works, but not yet very noticeable.

Priorities of this model

Cinematic photo-realistic pictures of female character (sfw, softcore nsfw)
Natural body, skin texture, [to be improved] environment (dirt, dust, stuff on floor, retro furniture and devices)
Realistic optical / photo effects (smooth swirly bokeh, analog film grain, aberrations [in progress]) of vintage lenses (Carl Zeiss Sonar, Jupiter 37a, Helios 44-2)
[To be improved] Urbex, abandoned, decaying interiors, depressive vibes, dimmed colors, fog, mist, vapor

How it was created

It is based on few merges of Analog Madness, URPM, Cyber Realistic, epiCRealism, ICBINP, Cine Diffusion with coefficients in 0.18..0.35.

It was trained with two datasets of carefully selected art photos with similar features (cinematic mood, atmospheric, charming anatomy, soft core / ero, retro interiors, morning outdoors, etc.). Total number of images in datasets: 600-700.

Trained as LoRa with 20 steps per image using Kohya_SS then merged with coefficient ~0.3 into Merge of mentioned Checkpoints. Better to use with my LoRa with the same name to amplify the effect.

Further improvements

By priority:

[done] Fix / Improve hand and fingers generation
[in progress] Improve gloom, bokeh, chromatic aberrations, spherical aberrations, light leaks and old analog film features
Fix / Improve feet and toes generation
[in progress] Add more urbex, abandoned, vandalized interiors and lost / forgotten outdoor scenery (suggest me good datasets pls ;)
Fine tuning / improvements of eyes and anatomy

Feedback appreciated...

모델 유형	체크포인트
기본 모델	SD 1.5
게시일	2023-09-01

CinEro_SD15

세부 정보

파일 다운로드 (1)

이 버전에 대해

모델 설명

Intro

Some tips on Prompting

Priorities of this model

How it was created

Further improvements

이 모델로 만든 이미지