I am a 2nd-year Ph.D. student at the School of Computer Science and Engineering, Nanyang Technological University, Singapre, supervised by Prof. Bo An. Previously, I obtained my B.Sc. of Artificial Intelligence from Nanjing University in 2022. In my undergraduate study, I worked with Prof. Yang Yu at LAMDA. I also interned at the MMLab of the Chinese University of Hong Kong with Prof. Bolei Zhou, Kuaishou Technology with Dr. Qingpeng Cai, and Kunlun 2050 Research with Prof. Shuicheng Yan.
I study safe, robust, and generalizable decision-making algorithms and their applications in real-world problems, such as video games, autonomous driving, and recommendation systems.
|Nov 8, 2023
|I will give a talk on Optimizing Long-term User Engagement in the Applied Artificial Intelligence Workshop of DAI 2023!
|Oct 19, 2023
|Our paper “State Regularized Policy Optimization on Data with Dynamics Shift” is accepted by NeurIPS 2023!
|Oct 6, 2023
|We release our new paper AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement.
- Regret minimization experience replay in off-policy reinforcement learningAdvances in Neural Information Processing Systems, 2021
- Guarded Policy Optimization with Imperfect Online DemonstrationsIn The Eleventh International Conference on Learning Representations , 2023
- State Regularized Policy Optimization on Data with Dynamics ShiftAdvances in Neural Information Processing Systems, 2023