首页 正文

DyDiT++: Diffusion Transformers with Timestep and Spatial Dynamics for Efficient Visual Generation

{{output}}
Diffusion Transformer (DiT), an emerging diffusion model for visual generation, has demonstrated superior perfor mance but suffers from substantial computational costs. Our investigations reveal that these costs primarily stem from the static inference paradig... ...