首页 正文

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

{{output}}
Few-step diffusion models enable efficient high-resolution image synthesis but struggle to align with specific downstream objectives due to limitations of existing reinforcement learning (RL) methods in low-step regimes with limited state spaces and suboptimal... ...