Sotabase
Home
Researchers
Career
·
Postdoctoral Researcher
,
MIT Robot Locomotion Group
2022–
·
Ph.D. candidate, EECS
,
MIT Robot Locomotion Group
Publications
(129)
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Handbook of Reinforcement Learning and Control · 2019
1,494
cited
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
International Conference on Machine Learning · 2018
651
cited
Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes
Neural Information Processing Systems · 2020
223
cited
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
SIAM Journal of Control and Optimization · 2019
211
cited
Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates
International Conference on Learning Representations · 2021
154
cited
Policy Optimization for H2 Linear Control with H∞ Robustness Guarantee: Implicit Regularization and Global Convergence
SIAM Journal of Control and Optimization · 2019
133
cited
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Neural Information Processing Systems · 2020
132
cited
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Neural Information Processing Systems · 2019
132
cited
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Neural Information Processing Systems · 2022
121
cited
Do Differentiable Simulators Give Better Policy Gradients?
International Conference on Machine Learning · 2022
118
cited
Topology optimization considering overhang constraint in additive manufacturing
Computers & structures · 2019
118
cited
Robust Multi-Agent Reinforcement Learning with Model Uncertainty
Neural Information Processing Systems · 2020
101
cited
Decentralized Q-Learning in Zero-sum Markov Games
Neural Information Processing Systems · 2021
95
cited
Networked Multi-Agent Reinforcement Learning in Continuous Spaces
IEEE Conference on Decision and Control · 2018
90
cited
Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning
AAAI Conference on Artificial Intelligence · 2021
88
cited
Decentralized multi-agent reinforcement learning with networked agents: recent advances
Frontiers of Information Technology & Electronic Engineering · 2019
83
cited
Dependency Analysis and Improved Parameter Estimation for Dynamic Composite Load Modeling
IEEE Transactions on Power Systems · 2017
79
cited
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence
International Conference on Machine Learning · 2022
79
cited
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
IEEE Transactions on Control of Network Systems · 2022
76
cited
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
IFAC-PapersOnLine · 2019
71
cited
Show all 129 papers →
Sotabase
Kaiqing Zhang | Researcher Profile | Sotabase | Sotabase