RMHF

세부 정보

파일 다운로드 (1)

모델 설명

I named my simple algorithm that generate new merging recipe near current and let user choose which is better to "learn" the best weight merging ratios with a exaggerated name "RMHF - Reinforcement Merging on Human Feedback".

https://github.com/TkskKurumi/DiffusersFastAPI/blob/main/rmhf_v2.py

이 모델로 만든 이미지