Ryan Sullivan

Computer Science PhD Candidate
rsulli@umd.edu

University of Maryland

I’m a 5th year PhD candidate at the University of Maryland studying reinforcement learning in open-ended environments. My research centers around developing an empirical understanding of policy gradient algorithms and automatic curriculum learning methods. I was one of the developers of PettingZoo for multiagent environments and have worked on a number of open source tools for RL, including my curriculum learning library Syllabus. I’ve interned at Amazon Science, Google Research, and Sony AI working on RL and RLHF. I’m very interested in research directions that use LLMs to augment RL training in complex environments, or RL to improve the general capabilities of LLMs. Feel free to reach out if you’d like to discuss ideas or opportunities to collaborate! I’m looking for industry research scientist or postdoc positions starting in August 2025.

Interests

Deep Reinforcement Learning
Automatic Curriculum Learning
Open-Endedness
RLHF/RLAIF

Education

PhD in Computer Science, Expected 2025
University of Maryland
BSc in Computer Science, 2020
Purdue University
BSc in Applied Statistics, 2020
Purdue University
BSc in Mathematics, 2020
Purdue University

Publications

Ryan Sullivan, Raghav Gupta, Ryan Sullivan, Yunxuan Li, Samrat Phatale, Abhinav Rastogi

February, 2025 In AAAI 2025

Robust Multi-Objective Preference Alignment with Online DPO

We introduce MO-ODPO, an efficient and robust algorithm for aligning large language models with multiple conflicting preferences, allowing flexible steerability at inference.

Ryan Sullivan, Ryan Sullivan, Ryan Pégoud, Ameen Ur Rahmen, Xinchen Yang, Junyun Huang, Aayush Verma, Nistha Mitra, John P. Dickerson

November, 2024 arXiv preprint

Syllabus: Portable Curricula for Reinforcement Learning Agents

Syllabus provides a portable library and universal API for implementing curriculum learning methods in reinforcement learning across diverse environments and RL libraries.

Ryan Sullivan, Kaiwen Wang, Rahul Kidambi, Ryan Sullivan, Alekh Agarwal, Christoph Dann, Andrea Michi, Marco Gelmi, Yunxuan Li, Raghav Gupta, Kumar Dubey

June, 2024 In EMNLP 2024

Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

We propose Conditional Language Policy (CLP), a framework for finetuning steerable language models that effectively balance multiple conflicting objectives without maintaining separate models.

Ryan Sullivan, Akarsh Kumar, Shengyi Huang, John P. Dickerson, Joseph Suarez

December, 2023 In NeurIPS 2023

Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks

We applied the implementation tricks introduced by DreamerV3 to PPO, and identified cases where they help or harm reward robustness.

Joseph Suarez, David Bloomin, Kyoung Whan Choe, Hao Xiang Li, Ryan Sullivan, Nishaanth Kanna Ravichandran, Daniel Scott, Rose S Shuman, Herbie Bradley, Louis Castricato, Phillip Isola, Kirsty You, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu

December, 2023 In NeurIPS 2023

Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning

The paper describes Neural MMO 2.0, a massive multitask update for the multiagent NeuralMMO environment.

Ryan Sullivan, J. K. Terry, Benjamin Black, John P. Dickerson

July, 2022 In ICML 2022

Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments

We developed a new visualization technique for reinforcement learning and used it to demonstrate a failure mode of policy gradient methods.

J. K. Terry, Benjamin Black, Nathaniel Grammel, Mario Jayakumar, Ananth Hari, Ryan Sullivan, Luis Santos, Rodriguez Perez, Caroline Horsch, Clemens Dieffendahl, Niall L. Williams, Yashas Lokesh, Praveen Ravi

October, 2021 In NeurIPS 2021

PettingZoo: A Standard API for Multi-Agent Reinforcement Learning

This paper introduces the PettingZoo library and the accompanying Agent Environment Cycle (“AEC”) games model.

Ryan Sullivan

Computer Science PhD Candidatersulli@umd.edu

University of Maryland

Publications

Computer Science PhD Candidate
rsulli@umd.edu