Snakebite 2

세부 정보

파일 다운로드

모델 설명

A completely new approach!

The original Snakebite was an Illustrious model injected with bigASP's compositional blocks. Snakebite 2.0, however, is primarily bigASP - but enhanced with a number of techniques to dramatically improve its textures and aesthetic capabilities.


⚠️ IMPORTANT:

This model uses Flow Matching, so you must connect it to the ModelSamplingSD3 node in ComfyUI to get correct results.


Why change the formula?

While I'm happy with the original Snakebite, there are some "gaps" between the two architectures I haven't been able to close with merging. Over the course of 1.0 through 1.4, I did what I could to minimize weird background objects and extra limbs, but it occurred to me that the perfect solution is already right here, in the form of vanilla bigASP 2.5.

I don't know if many people realize how good bigASP is... the prompt adherence is almost Flux-level with none of the censorship, plastic skin, steep hardware requirements, or bad licensing. It's pretty remarkable.

I set out to solve two of its main problems:

1. bigASP's textures are straight-up scuffed. I don't know if there was an issue with its aesthetic captioning or if it's simply "seen too much" (it was trained on 13 million images!), but no amount of (((high quality, masterpiece, so good))) is going to produce an image that looks even half as good as that of your average SDXL model.

2. You need to prompt it for everything. This is not necessarily a bad thing. Problem is, bigASP has some very weird ideas about the stuff you fail to mention. For example, if you ask for 1girl, standing it might give you a picture of 1girl, standing, morbidly obese, upside down.

Both of these problems have been addressed, at least to an extent. It wasn't easy! bigASP's input blocks are really delicate - if you try massaging them with aesthetic LoRAs, the model tends to fall apart completely. Compatibility with SDXL LoRAs is poor, because they were not trained with the Flow Matching technique.

Still, I found some blocks that responded well to my cosmetic upgrades. So I have been slowly and carefully introducing these blocks to things like Direct Preference Optimization with the goal of helping bigASP know what to do when you don't provide a 500-word prompt (i.e. make every picture look decent and not insane)


Advantages over v1

1. Prompt adherence is UH-MAZING for an SDXL model - check demo gallery

2. Understands more complex concepts and interactions

3. Mangled limbs are almost nonexistent thanks to Flow Matching

4. Very flexible with styles; more photorealistic than v1 while also more capable of generating illustrations

5. It can spell words pretty well, provided you don't mind re-rolling a few times

Disadvantages

1. Aesthetically, it's not as consistent as IL - but it's way better than vanilla bigASP

2. The lack of IL means booru tag knowledge is worse, but you might be surprised at how much bigASP knows... it can generate tons of mainstream characters and concepts just fine on its own

3. Backgrounds are a bit peculiar sometimes, not sure yet if this is bigASP's fault or something I merged in


Recommended Settings

Turbo:

  • 8 or 9 steps

  • LCM sampler

  • Beta scheduler (simple is okay too)

  • CFG 1

  • Model shift of 3 (this is the value that bigASP was trained on)


Prompting Guide

The #1 thing is, be careful with your fluff. If you ask for warm lighting, you better believe you're gonna get warm lighting. Like, a lot of it. Even adding a simple high quality to your prompt might change your image completely. So be deliberate. Start with zero fluff.

The effect is not always intuitive. For example--as the author of bigASP has pointed out--the term masterpiece quality "causes the model to tend toward producing illustrations/drawings instead of photo."

If it's photos you want, I've yet to find phrases that work better than onlyfans, abbywinters photo. Hey now, I'm being serious! These terms work great for innocent stuff, too.

Also, bigASP's training data was captioned with JoyCaption (online demo here, made by same author as bigASP) so you should try speaking to the model in a similar cadence and tone as JoyCaption does.

Most of the time, if Snakebite is not giving you the image you want, it's a matter of finding another phrasing.


Training LoRAs

Use the official LoRA training script for bigASP 2.5 here:

It's easy to install. I'm running it through my kohya-ss venv, as it only required a couple extra (non-conflicting) dependencies.

As far as I know, kohya does not support Flow Matching for SDXL training, hence the need for this script.

Default settings are good. You can increase the lora_rank and lora_alpha if you want, but the default value of 32 is usually fine. It buckets images. Be aware that it only saves a checkpoint at the end of training. I modified the script to save_every_n_steps instead. Let me know if you want my modified copy.

Don't train on turbo versions of Snakebite. Either use the full version (once I've uploaded it), or train on bigASP 2.5 vanilla.


Thank you. As always, I look forward to your feedback. Please share the model and upload some images to help it gain traction. It would be amazing if we could make Snakebite eligible for Civitai's online generator someday!

이 모델로 만든 이미지

이미지를 찾을 수 없습니다.