Intro

I made it just for fun and as experiment to build a model good for augmenting professional photographs. I am using Nikon camera with bunch of vintage lens. I expect to build an SD model which is able to produce moody, cinematic pictures with nice smooth bokeh and "analog style". Please note, that I don't plan to train this model on any hardcore nsfw. Don't expect / request it from "cinero" models ;) My preference is art, beauty and emotions.

Some tips on Prompting

Few examples:

"[grayscale : [dimmed colors : vibrant color splashes : 16] : 8]" - I call it "temporal trick". What it does is just make your prompt depending on current step. With this prompts SD will use "grayscale" on steps 1..7. SD will use "dimmed colors" on steps 8..15. On further steps SD will use "vibrant color splashes". I believe that there is no strict limits on nesting level. What you can do with it? You can effectively reduce the number of tokens SD process on each step (reduce the length of active prompt). On the first steps there is no sens to specify fine details. You only need to specify the scenery roughly. On the later steps there is no sense to spend tokens on describing the composition and lighting (I suspect). So, in theory, with this trick and big number of steps you can keep your prompt short and have build very rich prompt at the same time.
PS: this prompt above force the SD to draw the scene with very little colors and super vibrant segments (I showed grayscale images where a subject has few vibrant hair curls or clothes parts). Probably, you can reverse this effect by making whole picture colored with some parts made grayscale.
[Audrey Hepburn : Milla Jovovich : 16] - you can have fun with smooth transition from one face to another with XYZ plot script in Automatic1111. Also, this particular temporal trick with face / body helps my model to render most realistic and correct anatomy. I suspect you can also implement dynamic LoRa weighting with this trick. If LoRa don't have a trigger word you can just put the LoRa token like [ <lora: ...:0.42> : <lora: ...:0.99> : 16] or you can use multiple levels of nested "trigger words" from different loras.
"shot on %Brand Name% %Lens Mark Name% vintage lens" - if you find the vintage lens names which SD have in its memory, then you have a chance to improve an "analog style" of your picture. I used to use "Carl Zeiss Sonar", "Nokton", "Helios 44-2", but cannot confirm that each particular lens model gives unique effect. If you have you own list of confirmed lens models, then please consider to share it with community in comments to this model [%PICTURE OF LEELOO saying HELP%]
In near future I plan to build a training dataset with many images shot on beautiful vintage lenses to bring old-school photography soul into this model. I will use some unique trigger word for that or will use a "vintage lens" (not sure yet).
use "perfect anatomy", "anatomically correct body", "anatomically correct hand", "perfect hands", "anatomically correct fingers", "perfect limbs anatomy" and similar anatomical phrases to increase the chance to get correct anatomy.
Us words smooth bokeh, swirly bokeh, depth of field, smooth background to increase the separation of main subject and scenery.
Use "turbulent fog", "mist" and "haze" with "mystical lighting" to get nice atmospheric picture with super noticeable depth of scene. Also use "early morning" and "blue hour" phrases if you want to get cold morning vibes.
Use "scary face expression", "surprised expression", "inviting expression", "lustful face", etc to increase the chance to get noticeable emotions on face and visible "body language". It works, but not yet very noticeable.

Priorities of this model

Cinematic photo-realistic pictures of female character (sfw, softcore nsfw)
Natural body, skin texture, [to be improved] environment (dirt, dust, stuff on floor, retro furniture and devices)
Realistic optical / photo effects (smooth swirly bokeh, analog film grain, aberrations [in progress]) of vintage lenses (Carl Zeiss Sonar, Jupiter 37a, Helios 44-2)
[To be improved] Urbex, abandoned, decaying interiors, depressive vibes, dimmed colors, fog, mist, vapor

How it was created

It is based on few merges of Analog Madness, URPM, Cyber Realistic, epiCRealism, ICBINP, Cine Diffusion with coefficients in 0.18..0.35.

It was trained with two datasets of carefully selected art photos with similar features (cinematic mood, atmospheric, charming anatomy, soft core / ero, retro interiors, morning outdoors, etc.). Total number of images in datasets: 600-700.

Trained as LoRa with 20 steps per image using Kohya_SS then merged with coefficient ~0.3 into Merge of mentioned Checkpoints. Better to use with my LoRa with the same name to amplify the effect.

Further improvements

By priority:

[done] Fix / Improve hand and fingers generation
[in progress] Improve gloom, bokeh, chromatic aberrations, spherical aberrations, light leaks and old analog film features
Fix / Improve feet and toes generation
[in progress] Add more urbex, abandoned, vandalized interiors and lost / forgotten outdoor scenery (suggest me good datasets pls ;)
Fine tuning / improvements of eyes and anatomy

Feedback appreciated...

모델 유형	체크포인트
기본 모델	SD 1.5
게시일	2023-08-17

CinEro_SD15

세부 정보

파일 다운로드 (1)

모델 설명

Intro

Some tips on Prompting

Priorities of this model

How it was created

Further improvements

이 모델로 만든 이미지