post_rl_indexed_v2_lora

세부 정보

모델 설명

LoRAs trained using DRaFT-like post-RL using Siglip/Dino

Don't expect sensible results on any other model than indexed v2

These are just experiments. I will often not avoid over-training and forgo stronger regularization as I am more so looking for what effects are most prominent for varying data regimes.

이 모델로 만든 이미지