Wahtastic Mix

Details

Download Files

Model description

The ultimate goal of this model is to provide an experience very similar to the already fairly competent base of NoobAI v-pred, while fixing up rough edges. Many other merges suffer from the bimodality of either having good prompt adherence (closer to base noob) or good default aesthetics (closer to illustrious).

Ideally, both can be encapsulated in a model without sacrificing too much model knowledge to achieve this.

ETH Wallet Address for Donations: 0x645BebF82373865eC520d8AC2527524BfB174FF8

Wahtastic Roadmap

  • ✅ 1536x Super-resolution support

    • Allow for 1536x native generation (and slightly above), akin to Illustrious 2.0+
  • Fix e6 size tag implications (hyper ≠ huge ≠ big)

    • In short, e6 tags have implications; hyper_* implies huge_*, and huge_* implies big_*

    • Because of this, the model leans to associate big with huge, and huge with hyper, causing big_* to cause disproportionately large body parts at times.

  • Natural language captioning

    • Yes, CLIP sucks.

    • Using lodestone-rock's natural-language captions, ideally some amount of natural language understanding can be brought back

    • This is inspired by EasyFluff /XL

  • Superior style knowledge

    • ~20k e6 artists with > 500 < 20 posts

    • ~24k danbooru artists

Recommended Settings

For optimal results, we recommend the following inference parameters:

  • Sampler: Euler or Euler A

  • Scheduler: Normal or Beta

  • Steps: 16-24

  • CFG Scale: 3-6

  • Resolution:

  • For general use: 832x1200 (or similar aspect ratios with a total area around 1024x1024)

  • For V9.1 and above: Can natively handle 1536x resolutions.

Images made by this model

No Images Found.