AIPO: Improving Training Objective for Iterative Preference Optimization

...

September 13, 2024 · Yaojie Shen, Xinyao Wang, Yulei Niu, Ying Zhou, Lexin Tang, Libo Zhang, Fan Chen, Longyin Wen