Publications

(2025). Robust Multi-Objective Preference Alignment with Online DPO. In AAAI 2025.

PDF Cite

(2024). Syllabus: Portable Curricula for Reinforcement Learning Agents. arXiv preprint.

PDF Cite Dataset Project

(2024). Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning. In EMNLP 2024.

PDF Cite

(2023). Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks. In NeurIPS 2023.

PDF Cite Code

(2023). Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning. In NeurIPS 2023.

PDF Cite Code Project

(2023). Gradient Informed Proximal Policy Optimization. In NeurIPS 2023.

PDF Cite

(2022). Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments. In ICML 2022.

PDF Cite Code Project Slides

(2021). PettingZoo: A Standard API for Multi-Agent Reinforcement Learning. In NeurIPS 2021.

PDF Cite Code Project Slides