Models with tag: direct preference optimization