news

Jul 4, 2025 We release SimpleTIR, an end-to-end solution for stable multi-turn tool use RL training.
May 3, 2025 One paper is accepted at ICML 2025 as Spotlight Poster!
Jan 23, 2025 One paper is accepted at WWW 2025. Two papers are accepted to ICLR 2025.
Nov 8, 2023 I will give a talk on Optimizing Long-term User Engagement in the Applied Artificial Intelligence Workshop of DAI 2023!
Oct 19, 2023 Our paper “State Regularized Policy Optimization on Data with Dynamics Shift” is accepted by NeurIPS 2023!