Produces better Minions.
Trained on a ton of video clips at 5e-5, 256px, with captions for ~800 steps.