Here's a human-readable version of the provided configuration, organized into clear sections with explanations, followed by the sample prompts:

Dataset Configuration

Subsets:
- Number of Repeats: 4
  (Each image in the dataset will be repeated 4 times during training to increase its influence.)
- Image Directory: /workspace/training/7dd6c905-0edb-4cd6-bb4c-39fa6c179726/img
  (The folder containing the training images.)

General Settings

Resolution: 1024
(Images will be processed at a resolution of 1024x1024 pixels.)
Shuffle Caption: Enabled
(Randomly shuffles words in captions to improve generalization, while respecting the keep_tokens setting.)
Keep Tokens: 3
(Preserves the first 3 tokens of each caption in their original order during shuffling.)
Flip Augmentation: Enabled
(Applies horizontal flipping to images during training to increase dataset variety.)
Caption Extension: .txt
(Captions for images are stored in text files with a .txt extension.)
Enable Bucket: Enabled
(Groups images into buckets based on their resolution for efficient training.)
Bucket Resolution Steps: 64
(Buckets are created in increments of 64 pixels to match image resolutions.)
Bucket No Upscale: Enabled
(Prevents upscaling of images to fit bucket resolutions, preserving original sizes.)
Minimum Bucket Resolution: 256
(The smallest resolution bucket is 256x256 pixels.)
Maximum Bucket Resolution: 2048
(The largest resolution bucket is 2048x2048 pixels.)

Sample Prompts

These are the prompts used to generate sample outputs during training (likely from the file referenced in the previous configuration). They describe the concepts the model is being trained to generate:

Prompt: "father, rugby coach"
(Generates images of a father who is a rugby coach.)
Prompt: "grandfather, rugby coach"
(Generates images of a grandfather who is a rugby coach.)
Prompt: "father and grandfather went to mosque together"
(Generates images of a father and grandfather together at a mosque.)

Here's a human-readable version of the configuration, organized into clear sections with explanations:

Network Configuration

U-Net Learning Rate: 0.0005
(Controls how fast the U-Net model learns during training.)
Text Encoder Learning Rate: 0.00005
(Sets the learning rate for the text encoder, which processes text prompts.)
Network Dimension: 32
(Defines the size of the LoRA network layers.)
Network Alpha: 16
(Controls the scaling factor for LoRA weights.)
Network Module: LoRA (networks.lora)
(Uses LoRA, a lightweight fine-tuning method for efficient training.)

Optimizer Settings

Learning Rate: 0.0005
(The base learning rate for the optimizer.)
Learning Rate Scheduler: Cosine with Restarts
(Gradually reduces the learning rate following a cosine curve, with 3 cycles of restarts.)
Learning Rate Warmup Steps: 0
(No warmup period for the learning rate.)
Optimizer Type: Adafactor
(An adaptive optimizer designed for memory efficiency.)
Optimizer Arguments:
- Scale Parameter: Disabled
- Relative Step: Disabled
- Warmup Init: Disabled
  (Custom settings to tweak how Adafactor behaves.)

Training Settings

Maximum Training Steps: 0
(No fixed step limit; training is controlled by epochs instead.)
Maximum Training Epochs: 33
(The model will train for 33 passes over the dataset.)
Save Model Every N Epochs: 1
(Saves the model after every epoch.)
Sample Generation Every N Epochs: 1
(Generates sample outputs after every epoch.)
Sample Prompts File: Located at /workspace/training/7dd6c905-0edb-4cd6-bb4c-39fa6c179726/text/sample_prompts.txt
(Uses prompts from this file to generate samples during training.)
Sample Sampler: Euler_a
(Uses the Euler Ancestral sampling method for generating images.)
Training Batch Size: 4
(Processes 4 images per batch during training.)
Noise Offset: 0.1
(Adds slight noise to training data to improve stability.)
Clip Skip: 1
(Skips the last layer of the CLIP model for text encoding.)
Weighted Captions: Disabled
(Treats all captions equally, without assigning weights.)
Maximum Token Length: 225
(Allows up to 225 tokens for text prompts.)
Low RAM Mode: Disabled
(Uses full RAM capacity for faster training.)
Data Loader Workers: 8
(Uses 8 parallel workers to load data, speeding up training.)
Persistent Data Loader Workers: Enabled
(Keeps data loader workers active between batches for efficiency.)
Save Precision: Bfloat16 (bf16)
(Saves the model in bfloat16 format for reduced memory usage.)
Mixed Precision Training: Bfloat16 (bf16)
(Uses bfloat16 for calculations to balance speed and precision.)
Output Directory: /workspace/training/7dd6c905-0edb-4cd6-bb4c-39fa6c179726/model
(Where trained models are saved.)
Logging Directory: /workspace/training/7dd6c905-0edb-4cd6-bb4c-39fa6c179726/logs
(Where training logs are stored.)
Output Model Name: manly20250529--01khmer001
(The name of the saved model.)
Save Training State: Disabled
(Does not save the full training state, only the model weights.)
Xformers: Enabled
(Uses Xformers for optimized attention mechanisms, improving speed.)
SDPA (Scaled Dot-Product Attention): Enabled
(Enables efficient attention computation for better performance.)
No Half VAE: Enabled
(Disables half-precision for the Variational Autoencoder to maintain quality.)
Gradient Checkpointing: Enabled
(Reduces memory usage by recomputing gradients during backpropagation.)
Gradient Accumulation Steps: 1
(Processes gradients in a single step, no accumulation.)

Advanced Training Settings

Multi-Resolution Noise Iterations: 6
(Applies noise at multiple resolutions for 6 iterations to improve image quality.)
Multi-Resolution Noise Discount: 0.3
(Reduces noise impact by 30% across iterations.)
Minimum SNR Gamma: 5.0
(Enforces a minimum signal-to-noise ratio to stabilize training.)

Model Settings

Pretrained Model Path: /model_cache/@civitai/889818/889818.safetensors
(Uses this pre-trained model as the starting point.)
V2 Model: Disabled
(Not using a V2 model architecture.)

Saving Settings

Save Model Format: Safetensors
(Saves the model in the efficient Safetensors format.)

DreamBooth Settings

Prior Loss Weight: 1.0
(Balances the influence of prior preservation loss in DreamBooth training.)

Dataset Settings

Cache Latents: Enabled
(Precomputes and caches latent representations of images to speed up training.)

This configuration is tailored for fine-tuning a model (likely a Stable Diffusion model) using LoRA and DreamBooth techniques, with a focus on efficiency and quality. It uses bfloat16 precision, advanced optimization techniques, and noise management to produce a high-quality model named manly20250529--01khmer001.

Model Type	LORA
Base Model	Illustrious
Published	2025-06-04
Trained Words	chweeee1

manly20250529--01khmer001

Details

Download Files (1)

About this version

Model description

Images made by this model