Sotabase
Home
Researchers
Career
·
Assistant Professor
,
University of California, Irvine
2019–
·
Postdoctoral Researcher
,
Berkeley AI Research (BAIR)
2016–
·
Research Intern
,
UC Berkeley, AUTOLAB
2016–
Publications
(62)
RLlib: Abstractions for Distributed Reinforcement Learning
International Conference on Machine Learning · 2017
981
cited
Taming the Noise in Reinforcement Learning via Soft Updates
Conference on Uncertainty in Artificial Intelligence · 2015
361
cited
DART: Noise Injection for Robust Imitation Learning
Conference on Robot Learning · 2017
289
cited
Ray RLLib: A Composable and Scalable Reinforcement Learning Library
Neural Information Processing Systems · 2017
178
cited
Multi-Level Discovery of Deep Options
arXiv.org · 2017
131
cited
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
International Conference on Machine Learning · 2023
106
cited
DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations
Conference on Robot Learning · 2017
88
cited
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Neural Information Processing Systems · 2020
88
cited
AutoPandas: neural-backed generators for program synthesis
Proc. ACM Program. Lang. · 2019
85
cited
Fast and Reliable Autonomous Surgical Debridement with Cable-Driven Robots Using a Two-Phase Calibration Procedure
IEEE International Conference on Robotics and Automation · 2017
74
cited
XDO: A Double Oracle Algorithm for Extensive-Form Games
Neural Information Processing Systems · 2021
58
cited
Independent Natural Policy Gradient Always Converges in Markov Potential Games
International Conference on Artificial Intelligence and Statistics · 2021
54
cited
Parametrized Hierarchical Procedures for Neural Programming
International Conference on Learning Representations · 2018
30
cited
Multi-Task Hierarchical Imitation Learning for Home Automation
2019 IEEE 15th International Conference on Automation Science and Engineering (CASE) · 2019
29
cited
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
International Conference on Machine Learning · 2022
25
cited
A multi-agent control framework for co-adaptation in brain-computer interfaces
Neural Information Processing Systems · 2013
24
cited
Iterative Noise Injection for Scalable Imitation Learning
arXiv.org · 2017
22
cited
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
arXiv.org · 2022
21
cited
Minimum-information LQG control part I: Memoryless controllers
IEEE Conference on Decision and Control · 2016
18
cited
Target Entropy Annealing for Discrete Soft Actor-Critic
arXiv.org · 2021
18
cited
Show all 62 papers →
Sotabase
Roy Fox | Researcher Profile | Sotabase | Sotabase