Hao Sun

I am a Researcher at Google DeepMind. I did my PhD at the University of Cambridge, supervised by Prof. Mihaela van der Schaar. I obtained my M.Phil. at MMLab@CUHK, advised by Prof. Dahua Lin and Prof. Bolei Zhou. I studied Physics in my undergrad at Peking University, and my thesis was supervised by Prof. Zhouchen Lin.

News

🇦🇹 (2025.07) Attended ACL 2025 Run the ACL 2025 Tutorial (T1): Inverse RL Meets LLM Alignment at Vienna. Slide. We have a write-up for interested readers to find references.

🇺🇸 (2025.07) Invited talk on RL in the Era of LLMs at Intuit AI research.

🇬🇧 (2025.06) Invited talk on RL in the Era of LLMs at Jump Trading London.

📄 (2025.05) Multi-Objective and Personalized Alignment with PCA is accepted by ACL findings! Two Papers on Prompt Optimization are accepted by ACL as main and findings.

🇺🇸 (2025.03) Guest lecture on Inverse RL Meets LLMs at the UCLA Reinforcement Learning course.

🇺🇸 (2025.02) Attending AAAI 2025 to run the Tutorial: Inverse RL Meets LLMs. Thanks for joining us in Philadelphia! Slide.

📄 (2025.02) Our Reward Model Paper Part II: Active Reward Modeling is online.

📄 (2025.01) Our Reward Model Paper Part I: Foundation, Theory, and Alternatives is accepted by ICLR as an Oral 🎉. It is an amazing experience to work with Yunyi and Jef.