Fixed Perspective View

Details

Download Files

Model description

Please note that the original source of this Lora is CivitAI. (civitai.com/user/worgman). It is available for download for free and should not be published elsewhere.

This is a concept LoRA to manipulate the camera angle. It provides a fixed perspective shot of the image. Intended for NSFW content but is not limited to that. Please do post pictures below, as I'm interested to see what everyone is able to make with this tool.

Note that Adetailer and inpainting small regions does not play well with my perspective LoRA due to the emphasis on camera angle manipulation. My recommendation is to disable the Lora, significantly reduce weight usage, or lower the inpainting denoise to mitigate this.

Adetailer inpainting denoise of 0.25-0.30 mostly fixes this at the cost of a small amount of detail.

Spyview

This is a supplemental concept to the fixed perspective LoRA. It is completely separate, from Fixed Perspective V1 - V3 as of V1.0. This could change, but right now the plan is to keep them separate but continue updates for each within this page as they both are camera manipulation LoRA.

As of Spyview V1 the style impact has been reduced, and there has been more variation in what you can prompt to be in the foreground. V1 is far from perfect, however. Due to how much flexibility I allowed in the LoRA you will run into bad outputs occasionally. Good prompting helps to avoid this.

You do have some control over how far the subjects in the image are from the camera through the use of "out of frame" tags. If you call for a large amount of a subject to be out of frame the LoRA should bring the subject closer to the foreground. Alternatively, calling for Full Body in the prompt should do the opposite as the LoRA needs to output the entire image which makes the subject(s) appear further away.

What does V1 do well?

  • Low style impact compared to most of my V1 LoRA.

  • Very flexible

    • You can prompt for combinations of items within the foreground. Note not everything works, but if you experiment, you'll find some interesting combinations can be done.

What does V1 do okay?

  • The various trigger words are mostly differentiated from one another, but there is some overlap.

  • Sliding Door and Door foreground overlap a fair amount.

  • Door foreground tends to overpower other concepts

  • Window foreground could use some work. I tried to give too much control (indoor and outdoor looking in) and it caused issues.

  • Prompting for material type with door (for example glass door foreground) actually does work, despite no images of glass doors being used in the dataset. I'd like to continue this direction and add that type of flexibility into the LoRA. Will see how that goes over time.

Where does V1 fail?

  • Accuracy of LoRA to capture the prompter's intent is low. Certain tags within your prompt can cause wild variations in the output image. This is good for flexibility but not great for reproducibility of a specific idea. Testing was very difficult with this version as there were times when the images were not what I had anticipated with my prompt.

  • Certain concept tags are too diluted to work well. Object foreground is only about 3% of the overall dataset, so don't expect good results from using it.

  • The "architecture" of what is put in the foreground is nonsense depending on what is prompted. Door foreground usually shows the door at an unrealistic angle, or with multiple doorholes or handles, one on the wall and one on the door.

  • Keyhole foreground causes the model to put a keyhole on the image. I need to change the trigger word for this to fix it. It seems to either go at the very top of the keyhold cutout or gets put on the subject. Annoying, about 40% of the tests I did were fine, but it ruined the output of the ones that it showed up on. You can put keyhole in negative prompt to help some.

Fixed Perspective View

V1

My first attempt at creating a concept LoRA and something I have been collecting images for a while as I come across them. Felt like I had enough to give it a shot and this is the result. I will need to collect more to update it in the future. Requires ADetailer and HiResFix to get good outputs. If prompting more than one subject inpainting will likely be required to fix it up.

V2

I have expanded the size of the dataset used for training and have also categorized a section of the images to attempt to provide trigger words to manipulate the camera further. I was rough with it due to the sheer quantity of images used and limited time to spend pruning and cleaning up the tags. Due to that it could be improved, but I did manage to get part of the control that I wanted.

00345.png

V3

Removed some images and added new sources to training dataset to attempt to have the LoRA have less of a style impact on the output as much as V2 or V1. Toyed around with tag weights to try and add further control. V3 is not strictly better than V2.

Images made by this model

No Images Found.