Zhenghai Xue

Nanyang Technological University, Singapore. zhenghai001@e.ntu.edu.sg

prof_pic.jpg

I am a 3rd-year Ph.D. student at the College of Computing and Data Science, Nanyang Technological University, Singapre, supervised by Prof. Bo An. Previously, I obtained my B.Sc. of Artificial Intelligence from Nanjing University in 2022. In my undergraduate study, I worked with Prof. Yang Yu at LAMDA. I also interned at the MMLab of the Chinese University of Hong Kong with Prof. Bolei Zhou, Kuaishou Technology with Dr. Qingpeng Cai, and Kunlun 2050 Research with Prof. Shuicheng Yan.

I study safe, robust, and generalizable decision-making algorithms and their applications in real-world problems, such as large language models, GUI navigation, video games, autonomous driving, robotics locomotion, and recommendation systems.

News

Jan 23, 2025 One paper is accepted at WWW 2025. Two papers are accepted to ICLR 2025.
Nov 8, 2023 I will give a talk on Optimizing Long-term User Engagement in the Applied Artificial Intelligence Workshop of DAI 2023!
Oct 19, 2023 Our paper “State Regularized Policy Optimization on Data with Dynamics Shift” is accepted by NeurIPS 2023!

Selected publications

  1. Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy
    Zhenghai Xue, Bo An, and Shuicheng Yan
    In The Thirteenth International Conference on Learning Representations, 2025
  2. AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
    Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, and Bo An
    In Proceedings of the ACM Web Conference (Oral), 2025
  3. State Regularized Policy Optimization on Data with Dynamics Shift
    Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, and Bo An
    Advances in Neural Information Processing Systems, 2023
  4. Guarded Policy Optimization with Imperfect Online Demonstrations
    Zhenghai Xue, Zhenghao Peng, Quanyi Li, Zhihan Liu, and Bolei Zhou
    In The Eleventh International Conference on Learning Representations (Spotlight), 2023
  5. Regret minimization experience replay in off-policy reinforcement learning
    Xu-Hui Liu*, Zhenghai Xue*, Jingcheng Pang, Shengyi Jiang, Feng Xu, and Yang Yu
    Advances in Neural Information Processing Systems, 2021