SECourses 3D Render for FLUX - Full Dataset and Workflow Shared
详情
下载文件
模型描述
FLUX 风格的完整训练教程、指南与研究
包含完整工作流程、全部研究细节、过程、结论、检查点、对比、提示等的 Hugging Face 仓库 > https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX
触发词:ohwx 3d render
最后一张图为训练数据集网格图
这是对一个公开 LoRA 风格的训练(在 4 块 A6000 上分别进行 4 次训练)。
实验对比带描述与不带描述的训练效果,以观察哪种方式在 FLUX 上进行风格训练效果最佳。
使用多 GPU 批量 Joycaption 应用生成描述。
我使用了我自己的多 GPU Joycaption 应用(使用 8 块 A6000 进行超快速描述生成)
https://www.patreon.com/posts/110613301

我使用了我的 Gradio 批量描述编辑器,修改部分词句并添加激活标记 “ohwx 3d render”
https://www.patreon.com/posts/108992085

无描述数据集仅使用 “ohwx 3d render” 作为描述。
我正在使用我最新的 4x_GPU_Rank_1_SLOW_Better_Quality.json 配置,在 4 块 A6000 GPU 上训练 500 个周期 —— 114 张图像
https://www.patreon.com/posts/110879657

所有训练均以 Float 格式保存,LoRA 网络秩为 128,因此每个检查点大小超过 2GB。
不一致数据集训练
这是我使用以下数据集进行的首次训练:
Inconsistent-Training-Dataset-Images-Grid.jpg
当你仔细观察上方共享的网格图像时,会发现该数据集并不一致。
使用描述的训练数据集(仅限“带描述”训练)可在以下目录查看:
共包含 114 张图像。
此次训练总步数为:500 * 114 / 4(4 块 GPU,批量大小为 1)= 14250 步。
在 4 块 RTX A6000 GPU 上以慢速配置训练耗时约 37 小时——使用快速配置仅需约一半时间。
使用该数据集共进行了两次训练,第 500 轮检查点命名如下:
SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors
其检查点保存在以下文件夹中:
Training-Checkpoints-NO-Captions Training-Checkpoints-With-Captions
其网格结果如下所示:
Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg
当你仔细观察上图时会发现其输出结果不一致。
一致数据集训练
在发现初始训练数据集不一致后,我对数据集进行了修剪,使其更加一致。
Fixed-Consistent-Training-Dataset-Images-Grid.jpg
当你仔细观察上方共享的网格图像时,会发现其一致性显著提升,尽管仍不完美。
现在数据集共包含 66 张图像。
本次训练所用的带描述数据集(仅限“带描述”训练)可在此目录查看:
Fixed-Consistent-Training-Dataset
此次训练总步数为:500 * 66 / 4(4 块 GPU,批量大小为 1)= 8250 步。
在 4 块 RTX A6000 GPU 上以慢速配置训练耗时约 24 小时——使用快速配置仅需约一半时间。
使用该数据集共进行了两次训练,第 500 轮检查点命名如下:
SECourses_3D_Render_Style_Fixed_Dataset_NO_Captions.safetensors SECourses_3D_Render_Style_Fixed_Dataset_With_Captions.safetensors
其检查点保存在以下文件夹中:
Training-Checkpoints-Fixed-DATASET-NO-Captions Training-Checkpoints-Fixed-DATASET-With-Captions
其网格结果如下所示——该结果也包含了不一致数据集的对比结果:
Fixed-Consistent-Training-Dataset-Results-Grid-50700x15500px.jpg
当你仔细观察上图时会发现,现在结果一致性显著提升。
最佳检查点与结论
当使用不一致数据集时,带描述的训练效果远优于无描述训练。
然而,当使用一致数据集进行训练时,无描述训练在早期周期即展现出更优且更稳定的结果。
因此我得出结论:无描述数据集第 75 轮为最佳检查点。
以下是针对一致数据集的对比图像:
Fixed-Consistent-Training-Dataset-No-Captions-Only-Grid.jpg
Fixed-Consistent-Training-Dataset-With-Captions-Only-Grid.jpg
75 轮相当于 75 * 66 / 4 = 1238 步。
训练你自己的风格教程
1:https://youtu.be/bupRePUOA18
FLUX:首个开源 txt2img 模型真正超越 Midjourney 及其他模型——FLUX 等待已久的 SD3
2:https://youtu.be/nySGu12Y05k
FLUX LoRA 训练简化指南:使用 Kohya SS GUI 从零到高手(8GB GPU,Windows)教程
3:https://youtu.be/-uhL2nW7Ddw
在 Massed Compute 和 RunPod 上实现超快超低成本 FLUX LoRA 训练教程——无需 GPU!
该数据集不可用于商业用途。

网格测试提示 - 从网格中选取的示例图像(无选择性挑选)
a ohwx 3d rendering of a car
a car rendered in ohwx 3d style
a ohwx style car image
a ohwx render of a car
a ohwx car
a ohwx 3d rendering of a chest, depicted in a cartoon style. The background is a plain white, making the chest and its contents stand out clearly. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give the chest a realistic, three-dimensional appearance. The metal bands and rivets add a sense of realism and durability to the chest. The image is vibrant and eye-catching, inviting the viewer to imagine the treasure within. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold, with a focus on oranges, browns, and golds to create a sense of warmth and excitement. The overall mood is one of excitement and discovery.
a ohwx 3d rendering of an airplane, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of a battleship, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of a robot, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of a dog, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of a cat, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of an axe, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
a ohwx 3d rendering of a house, depicted in a cartoon style. The background is a plain white. The overall style is playful and whimsical, with clean lines and bright colors, suggesting a fantasy or adventure theme. The illustration is highly detailed, with a focus on textures and shading to give a realistic, three-dimensional appearance. The image is vibrant and eye-catching. The illustration is likely used in a digital context, such as a game or a children's book. The colors are bright and bold to create a sense of warmth and excitement.
一个OHWX 3D渲染的龙,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的花,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的玫瑰,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的坦克,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的电脑,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的图形处理器(GPU),采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的叉子,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的锁,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
一个OHWX 3D渲染的雨伞,采用卡通风格,背景为纯白色。整体风格活泼而富有想象力,线条清晰、色彩明亮,暗示着奇幻或冒险主题。插图细节丰富,注重纹理与阴影处理,以呈现真实而立体的视觉效果。画面鲜艳夺目,很可能用于数字场景,如游戏或儿童读物。色彩鲜明大胆,营造出温暖与兴奋的感觉。
最近一个月下载量
**\-**
此[模型](http://model.How)的下载量未被追踪。[如何追踪](https://huggingface.co/docs/hub/models-download-stats)
**推理API**
无法确定此模型的库。请检查 th


















