I’m a fourth-year Ph.D. student at the Department of Computer Science and Technology, Tsinghua University, advised by Prof. Jun Zhu. Before that, I received my B.E. degree from the same department at Tsinghua University in 2022. In the summer of 2024, I was honored to have an internship at NVIDIA Deep Imagination Research in the San Francisco Bay Area.
My research focuses on developing principled, insightful, scalable, efficient and effective training/inference techniques for deep generative models, with a particular emphasis on diffusion-related models. I am also interested in reinforcement learning, unified multimodal model and world model.
Selected Publications & Preprints [full list]
(*) denotes equal contribution; (†) denotes corresponding author
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Technical Report, 2025
-
World Simulation with Video Foundation Models for Physical AI
NVIDIA
Technical Report, 2025
-
Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency
In The Fourteenth International Conference on Learning Representations, 2026
-
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
In The Fourteenth International Conference on Learning Representations, 2026
-
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
In Proceedings of the 42nd International Conference on Machine Learning, 2025
-
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
In The Thirteenth International Conference on Learning Representations, 2025
-
Diffusion Bridge Implicit Models
In The Thirteenth International Conference on Learning Representations, 2025
-
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Technical Report, 2024
-
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics
In Advances in Neural Information Processing Systems, 2023
-
Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
In Proceedings of the 40th International Conference on Machine Learning, 2023
My slides for TSAIL reading group:
I enjoy playing ping-pong🏓 in my free time. I also watch esports (especially League of Legends) and have a taste for Chinese calligraphy (a piece of the poem 'Qingming' ("清明" in Chinese)). I customized and hosted a Minecraft server at middle school ([Pic1][Pic2][Pic3][Pic4]). I am enthusiastic about math since middle school. In high school, I compiled and wrote a set of interesting math exercises to form a test ([Problems][Answers], in Chinese).