Sotabase
Home
Researchers
Career
·
PHD Student
,
University of Oxford
2015–
·
Master's degree, Physics, 1st
,
University of Oxford
2011–2015
Publications
(28)
Counterfactual Multi-Agent Policy Gradients
AAAI Conference on Artificial Intelligence · 2017
2,378
cited
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
International Conference on Machine Learning · 2018
1,887
cited
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Journal of machine learning research · 2020
1,190
cited
The StarCraft Multi-Agent Challenge
Adaptive Agents and Multi-Agent Systems · 2019
1,138
cited
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
International Conference on Machine Learning · 2017
647
cited
Weighted QMIX: Expanding Monotonic Value Function Factorisation
Neural Information Processing Systems · 2020
433
cited
A Survey of Reinforcement Learning Informed by Natural Language
International Joint Conference on Artificial Intelligence · 2019
304
cited
Multi-Agent Common Knowledge Reinforcement Learning
Neural Information Processing Systems · 2018
117
cited
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
International Conference on Machine Learning · 2018
104
cited
Transient Non-stationarity and Generalisation in Deep Reinforcement Learning
International Conference on Learning Representations · 2020
98
cited
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
International Conference on Learning Representations · 2017
72
cited
TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning
International Conference on Learning Representations · 2017
71
cited
Growing Action Spaces
International Conference on Machine Learning · 2019
47
cited
Proper Value Equivalence
Neural Information Processing Systems · 2021
38
cited
The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
arXiv.org · 2020
35
cited
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
International Conference on Machine Learning · 2021
27
cited
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Neural Information Processing Systems · 2023
19
cited
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning
Neural Information Processing Systems · 2019
18
cited
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs
International Conference on Machine Learning · 2019
13
cited
Self-Consistent Models and Values
Neural Information Processing Systems · 2021
8
cited
Show all 28 papers →
Sotabase
Greg Farquhar | Researcher Profile | Sotabase | Sotabase