publications

(*) denotes equal contribution; (†) denotes corresponding author

2025

  1. Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
    arXiv preprint arXiv:2503.01103, 2025
  2. Visual Generation Without Guidance
    Huayu Chen*, Kai Jiang*Kaiwen ZhengJianfei ChenHang Su, and Jun Zhu
    arXiv preprint arXiv:2501.15420, 2025
  3. Elucidating the Preconditioning in Consistency Distillation
    Kaiwen Zheng*Guande He*Jianfei ChenFan Bao, and Jun Zhu
    In The Thirteenth International Conference on Learning Representations, 2025
  4. Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
    In The Thirteenth International Conference on Learning Representations, 2025

    Top 5%

  5. Diffusion Bridge Implicit Models
    Kaiwen Zheng*Guande He*Jianfei ChenFan Bao, and Jun Zhu
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. Consistency Diffusion Bridge Models
    Guande He*Kaiwen Zheng*Jianfei ChenFan Bao, and Jun Zhu
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  2. Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
    Huayu ChenKaiwen ZhengHang Su, and Jun Zhu
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  3. Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  4. Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
    Fan BaoChendong Xiang*, Gang Yue*Guande He*Hongzhou Zhu*Kaiwen Zheng*Min Zhao*Shilong Liu* , Yaole Wang*, and Jun Zhu
    Technical Report, 2024
  5. InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
    Jianhui LiShilong Liu , Zidong Liu , Yikai WangKaiwen Zheng, Jinghui Xu , Jianmin Li, and Jun Zhu
    In The Twelfth International Conference on Learning Representations, 2024

2023

  1. Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
    Zehua Chen*Guande He*Kaiwen Zheng*Xu Tan, and Jun Zhu
    arXiv preprint arXiv:2312.03491, 2023
  2. DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics
    Kaiwen Zheng*Cheng Lu*Jianfei Chen, and Jun Zhu
    In Advances in Neural Information Processing Systems, 2023
  3. Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
    Kaiwen Zheng*Cheng Lu*Jianfei Chen, and Jun Zhu
    In Proceedings of the 40th International Conference on Machine Learning, 2023
  4. PREIM3D: 3d Consistent Precise Image Attribute Editing From a Single Image
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

  1. Maximum Likelihood Training for Score-Based Diffusion ODEs by High Order Denoising Score Matching
    In Proceedings of the 39th International Conference on Machine Learning, 2022