relatively large dataset (400+)
around 8000 steps
lr 0.0002
large dim. don't know if that was necessary, but it is what it is now.
works well, i think