Greg Farquhar | Researcher Profile | Sotabase

Career

· PHD Student, University of Oxford2015–

· Master's degree, Physics, 1st, University of Oxford2011–2015

Publications (28)

Counterfactual Multi-Agent Policy Gradients

AAAI Conference on Artificial Intelligence · 2017

2,378

cited

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

International Conference on Machine Learning · 2018

1,887

cited

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Journal of machine learning research · 2020

1,190

cited

The StarCraft Multi-Agent Challenge

Adaptive Agents and Multi-Agent Systems · 2019

1,138

cited

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

International Conference on Machine Learning · 2017

647

cited

Weighted QMIX: Expanding Monotonic Value Function Factorisation

Neural Information Processing Systems · 2020

433

cited

A Survey of Reinforcement Learning Informed by Natural Language

International Joint Conference on Artificial Intelligence · 2019

304

cited

Multi-Agent Common Knowledge Reinforcement Learning

Neural Information Processing Systems · 2018

117

cited

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

International Conference on Machine Learning · 2018

104

cited

Transient Non-stationarity and Generalisation in Deep Reinforcement Learning

International Conference on Learning Representations · 2020

cited

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

International Conference on Learning Representations · 2017

cited

TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning

International Conference on Learning Representations · 2017

cited

Growing Action Spaces

International Conference on Machine Learning · 2019

cited

Proper Value Equivalence

Neural Information Processing Systems · 2021

cited

The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning

arXiv.org · 2020

cited

PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning

International Conference on Machine Learning · 2021

cited

Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design

Neural Information Processing Systems · 2023

cited

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Neural Information Processing Systems · 2019

cited

A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs

International Conference on Machine Learning · 2019

cited

Self-Consistent Models and Values

Neural Information Processing Systems · 2021

cited

Sotabase