Zhenghai Xue

I am a 3rd-year Ph.D. student at the College of Computing and Data Science, Nanyang Technological University, Singapre, supervised by Prof. Bo An. Previously, I obtained my B.Sc. of Artificial Intelligence from Nanjing University in 2022. In my undergraduate study, I worked with Prof. Yang Yu at LAMDA. I also interned at the MMLab of the Chinese University of Hong Kong with Prof. Bolei Zhou, Kuaishou Technology with Dr. Qingpeng Cai, and Kunlun 2050 Research with Prof. Shuicheng Yan.

I study safe, robust, and generalizable decision-making algorithms and their applications in real-world problems, such as large language models, GUI navigation, video games, autonomous driving, robotics locomotion, and recommendation systems.

News

Jan 23, 2025	One paper is accepted at WWW 2025. Two papers are accepted to ICLR 2025.
Nov 8, 2023	I will give a talk on Optimizing Long-term User Engagement in the Applied Artificial Intelligence Workshop of DAI 2023!
Oct 19, 2023	Our paper “State Regularized Policy Optimization on Data with Dynamics Shift” is accepted by NeurIPS 2023!

Selected publications

Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy

Zhenghai Xue, Bo An, and Shuicheng Yan

In The Thirteenth International Conference on Learning Representations, 2025

HTML
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems

Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, and Bo An

In Proceedings of the ACM Web Conference (Oral), 2025

arXiv
State Regularized Policy Optimization on Data with Dynamics Shift

Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, and Bo An

Advances in Neural Information Processing Systems, 2023

arXiv HTML Code
Guarded Policy Optimization with Imperfect Online Demonstrations

Zhenghai Xue, Zhenghao Peng, Quanyi Li, Zhihan Liu, and Bolei Zhou

In The Eleventh International Conference on Learning Representations (Spotlight), 2023

arXiv HTML Code
Regret minimization experience replay in off-policy reinforcement learning

Xu-Hui Liu*, Zhenghai Xue*, Jingcheng Pang, Shengyi Jiang, Feng Xu, and Yang Yu

Advances in Neural Information Processing Systems, 2021

arXiv Code Video