VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video
Diffusion Models
Beyond Scaling Laws: Understanding Transformer Performance with
Associative Memory
Coin3D: Controllable and Interactive 3D Assets Generation with
Proxy-Guided Conditioning
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
Fine-Grained Chinese Understanding
Compositional Text-to-Image Generation with Dense Blob Representations