post_rl_indexed_v2_lora

epiTune

689

style

dpvo_2 dpvo_1 llrrw_4 llrrw_3 llrrw_2 llrrw_1 siglip cfg self-fit-3 rapid self gt

세부 정보

파일 다운로드 (1)

모델 설명

LoRAs trained using DRaFT-like post-RL using Siglip/Dino

Don't expect sensible results on any other model than indexed v2

These are just experiments. I will often not avoid over-training and forgo stronger regularization as I am more so looking for what effects are most prominent for varying data regimes.

이 모델로 만든 이미지

정렬

모델 유형	LORA
기본 모델	Illustrious
게시일	2026-01-03