Lab @1 : trying to have better pantyhose, see-through, maturity.

Details

Download Files

Model description

For all the LoRa Lab tests' info. (如果像了解更多轮次的尝试)

please go to /model/1366934

请移步 /model/1366934

Explanation different approach of current Test Suite (本轮训练实验集中, 不同方案的说明)

Try the following approaches to see how the trained "nsfw stylized (pantyhose & see-through)" LoRa feels and what the differences are:

This time captions are directly derived using WD14 Conv v2 tagger, without any additions or deletions

尝试一下如下的方案, 所训练出来的 "nsfw 风格化 (pantyhose & see-through)" 的 LoRa 感觉如何, 各自有何差别 :

本次 captions 尝试直接用 WD14 Conv v2 tagger 推导, 无任何的增删

Desired Training Effects (训练想要的效果)

  • Increase the maturity of girls

  • Add natural transparency effect to clothing

  • Adjust the style of pantyhose (flesh-colored shiny style; and increase the default probability of pantyhose when not specified)

  • 提升女孩的成熟度

  • 加入衣物的原生透明效果

  • 调整连裤袜的风格 (肉色舍宾风格; 且增加无指定时裤袜的默认几率)

Some Findings (一些发现)

Lab@1 - 250316 Record (Lab@1 - 250316 记录)

  • Some input images (dataset) and regularization images mentioned here can be referenced in the model's image showcase (showcase)

    • One great thing about civit is that it displays image-related prompts and parameters very well, including XY comparison charts generated by webui
  • p1 wor can actually pixel-level replicate the input girl's appearance in many scenes

    • But the disadvantage is also very obvious: without special prompts, it tries very hard to restore all features, with little variability
  • full body in WAI's base model has a high coupling with standing posture

    • But trying to add standing in neg prompt doesn't seem to have obvious effect
  • The feeling of p24 training is that it does migrate girl's features to other characters, as seen in the test image of Ao Run (敖闰) where the dragon's face shows obvious feminization influence

  • p24 wor version, when +pantyhose is used, sometimes seems to produce rather peculiar poses (not sure if it's because most poses in the dataset are quite large)

    • For details not learned (like hands, small objects, some detail logic, ...), p24 doesn't seem to handle them very well
  • p24 wr feels visually the best in quality

    • Although many features are not directly reflected, but rather have their own understanding and fusion

    • But its control ability over +full body seems problematic

      • Not sure if it's related to reg images containing many partial body images
    • Also, when pantyhose appears without other prompt controls, it tends to output lower body images

  • Additionally, when dealing with complex compositions, p24 series sometimes tends to express more exaggeratedly, leading to image collapse

  • 这里说的一些输入图片 (dataset), 还有正则化图片, 可以参考本个模型里面的图片演示 (showcase)

    • civit 非常不错的一个点是, 它会把图片相关的提示词和参数很好的展示出来, 包括 webui 产出的 XY 对比图
  • p1 wor 居然能在不少的场景下, 像素级别地复刻输入的女孩形象

    • 但劣势也非常明显, 在无特殊的提示词时, 它会非常努力地还原所有特征, 可变性不强
  • full body 在 WAI 这个底模 (base model) 里, 和 standing 的姿态有很高的耦合性

    • 而且, 尝试过在 neg prompt 里加上 standing 好像效果也不明显
  • p24 训练下的感觉, 确实会将女孩的特征迁移到其他形象上, 如测试中的敖闰 (ao run) 图片中的 "龙" 的脸部有很明显的女性化影响

  • p24 wor 版本, 在 +pantyhose 时, 有时好像容易出现比较奇特的姿势 (不知道是否和数据集大多数姿势都比较大的原因)

    • 没有学习过的细节处理 (如手部, 一些小物件, 一些细节逻辑, ...), p24 好像处理起来不是太好
  • p24 wr 的质量在直观上感觉最优

    • 虽然很多特征并没有很直接的体现, 而是有它自己的理解和融合

    • 但是它对 +full body 的控制能力好像有问题

      • 不知道是否和 reg 图片包含很多局部身体图片有关
    • 并且, 一旦出现 pantyhose 且没有其他提示词控制时, 它倾向于输出 lower body 的图片

  • 另外, p24 系列处理复杂构图时, 有时会倾向于更夸张地去表达, 导致崩图

Potential Issues (可能的问题)

  • The entire dataset is 1girl, solo, so the default style for other characters may have problems (such as male characters)

  • Not sure about LoRa's ability to handle complex prompts

  • 全数据集都是 1girl, solo, 所以其他形象的默认风格可能会出现问题 (如男人形象)

  • 不太确定 LoRa 在处理复杂 prompt 的能力

Current Training Settings (本次训练相关设置)

Number of images: number (refer to p1, p3, p8, p24)
Regularization: with reg (wr) | without reg (wor)
图片数量: number (参考 p1, p3, p8, p24)
是否正则化: with reg (wr) | without reg (wor)

Model Training Parameters (模型训练参数)

[Training Parameters]
BASE=https://civitai.com/models/827184/wai-nsfw-illustrious-sdxl => v12
EPOCHS=8                                # Number of training epochs
SEED=1861

[Resolution Settings]
TRAINING_WIDTH=1024                     # Training width
TRAINING_HEIGHT=1152                    # Training height
MAX_BUCKET_RESO=$TRAINING_HEIGHT        # Maximum bucket resolution

[Hardware Adaptation]
DIM=32                                  # Network dimension
DIM_ALPHA=28                            # Network alpha value
TRAINING_BATCH_SIZE=1                   # Batch size

[Training Parameters]
--train_batch_size 1
--learning_rate 0.0001
--unet_lr 0.0001
--text_encoder_lr 0.00001
--optimizer_type "AdamW8bit"
--mixed_precision "bf16"
--save_model_as "safetensors"
--save_precision "fp16"
--save_every_n_epochs 2
--cache_latents to_disk
--xformers
--lowram
[训练参数]
BASE=https://civitai.com/models/827184/wai-nsfw-illustrious-sdxl => v12
EPOCHS=8                                # 训练轮数
SEED=1861

[分辨率设置]
TRAINING_WIDTH=1024                     # 训练宽度
TRAINING_HEIGHT=1152                    # 训练高度
MAX_BUCKET_RESO=$TRAINING_HEIGHT        # 最大分桶分辨率

[硬件适配]
DIM=32                                  # 网络维度 
DIM_ALPHA=28                            # 网络alpha值 
TRAINING_BATCH_SIZE=1                   # 批次大小

[训练参数]
--train_batch_size 1
--learning_rate 0.0001
--unet_lr 0.0001
--text_encoder_lr 0.00001
--optimizer_type "AdamW8bit"
--mixed_precision "bf16"
--save_model_as "safetensors"
--save_precision "fp16"
--save_every_n_epochs 2
--cache_latents to_disk
--xformers
--lowram

Images made by this model

No Images Found.