Ryan Sullivan, Kaiwen Wang, Rahul Kidambi, Ryan Sullivan, Alekh Agarwal, Christoph Dann, Andrea Michi, Marco Gelmi, Yunxuan Li, Raghav Gupta, Kumar Dubey
(2024).
Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
In
EMNLP 2024.
Joseph Suarez, David Bloomin, Kyoung Whan Choe, Hao Xiang Li, Ryan Sullivan, Nishaanth Kanna Ravichandran, Daniel Scott, Rose S Shuman, Herbie Bradley, Louis Castricato, Phillip Isola, Kirsty You, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu
(2023).
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning.
In
NeurIPS 2023.
J. K. Terry, Benjamin Black, Nathaniel Grammel, Mario Jayakumar, Ananth Hari, Ryan Sullivan, Luis Santos, Rodriguez Perez, Caroline Horsch, Clemens Dieffendahl, Niall L. Williams, Yashas Lokesh, Praveen Ravi
(2021).
PettingZoo: A Standard API for Multi-Agent Reinforcement Learning.
In
NeurIPS 2021.