Sotabase
Home
Researchers
Career
·
Graduate Fellow
,
UC Berkeley - Kavli Center for Ethics, Science, and the Public
2024–
Publications
(8)
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Trans. Mach. Learn. Res. · 2023
738
cited
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
International Conference on Learning Representations · 2023
96
cited
PERSONA: A Reproducible Testbed for Pluralistic Alignment
International Conference on Computational Linguistics · 2024
50
cited
AI Alignment with Changing and Influenceable Reward Functions
International Conference on Machine Learning · 2024
43
cited
Analyzing Human Models that Adapt Online
IEEE International Conference on Robotics and Automation · 2021
23
cited
Understanding Hidden Context in Preference Learning: Consequences for RLHF
10
cited
On the Computational Consequences of Cost Function Design in Nonlinear Optimal Control
IEEE Conference on Decision and Control · 2022
1
cited
Sotabase
Anand Siththaranjan | Researcher Profile | Sotabase | Sotabase