Refined training data with better control over viewer's perspective.
We sit opposite to the lady in the train.