Pixart-σ weak-to-strong training of diffusion transformer for 4K text-to-image generation.
PixArt-Σ
Weak-to-strong training of diffusion transformer for 4K text-to-image generation.
In this paper, we introduce PixArt-Sigma, a Diffusion Transformer model~(DiT) capable of directly generating images at 4K resolution.
Join the discussion on this paper page.
Comments are closed.