Joschek's Ballgags for Z-Image

세부 정보

파일 다운로드 (1)

모델 설명

Update v2:

retrained with 2 changed variables: lr 0.0002 and much more detailed captioning with qwen3-vl. its much better alsthough there is some overfit, but it's fine maybe?!?

the right settings/ dataset for z-image are still a mystery to me.

i now suspect it really wants detailed captions to work well.

don't know if the overfit is solvable.

but anyways: works much better now and seems really compatible with style or character lora....


v2 and v2.12 are just different epochs. as i can't for the life of me decide which ones better.



V1 notes: Still not too sure about z-image training...

results seem to be 50/50 good bad.

training notes v1: small dataset with portrait selection (90 Images), 1600steps, 0.0005, 32/32

이 모델로 만든 이미지