RMHF
세부 정보
파일 다운로드 (1)
이 버전에 대해
모델 설명
I named my simple algorithm that generate new merging recipe near current and let user choose which is better to "learn" the best weight merging ratios with a exaggerated name "RMHF - Reinforcement Merging on Human Feedback".
https://github.com/TkskKurumi/DiffusersFastAPI/blob/main/rmhf_v2.py


