My name is Chao Wen. I’m a PhD student at the Max Planck Institute for Software Systems (MPI-SWS). I am advised by Dr. Adish Singla in the Machine Teaching Group.

I received M.Sc. in the College of Computer Science and Technology from Nanjing University of Aeronautics and Astronautics, China, advised by Prof. Xiaoyang Tan. Then I worked as an algorithm engineer at Lazada, Alibaba Group from June 2021 to July 2022.

My research interests include reinforcement learning (RL) and AI for programming education.

Publications

  • Chao Wen, Miao Xu, Zhilin Zhang, Zhenzhe Zheng et al. A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising. ACM International Conference on Web Search and Data Mining (WSDM 2022). [pdf, code, poster]

  • Chao Wen, Xinghu Yao, Yuhui Wang, Xiaoyang Tan. SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2020). [pdf, code, video]

  • Xinghu Yao, Chao Wen, Yuhui Wang, Xiaoyang Tan. SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems (TNNLS 2021). [pdf]

  • Yuhui Wang, Hao He, Chao Wen, Xiaoyang Tan. Truly Proximal Policy Optimization. arXiv:1903.07940. [pdf, code]

Experience

  • 2021.06 - 2022.07: Algorithm Engineer at Lazada, Alibaba Group.
  • 2020.07 - 2021.01: Research Intern at Alimama, Alibaba Group.

Selected Honors

  • China National Scholarship for Graduate Students, 2020
  • China National Scholarship for Undergraduate Students, 2016