Yaojie's Site
Home
Projects
AIPO: Improving Training Objective for Iterative Preference Optimization