It turns out: Size does matter.
Not the most reliable model, takes a bit of trial and error. It helps to prompt some reference objects for scale.