Sotabase
Home
Researchers
Career
·
Research Scientist
,
DeepMind
·
Brain Residency Program (now AI Residency)
,
Google
·
TBD
,
Meta
·
MSc Student
,
MILA lab, Université de Montréal
·
PhD Student
,
University of California, Berkeley
·
Undergraduate Student
,
University of Toronto
Publications
(40)
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
arXiv.org · 2024
3,137
cited
Theano: A Python framework for fast computation of mathematical expressions
arXiv.org · 2016
2,366
cited
Probabilistic Model-Agnostic Meta-Learning
Neural Information Processing Systems · 2018
726
cited
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples
International Conference on Learning Representations · 2019
673
cited
An Actor-Critic Algorithm for Sequence Prediction
International Conference on Learning Representations · 2016
661
cited
On Using Monolingual Corpora in Neural Machine Translation
arXiv.org · 2015
580
cited
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Neural Information Processing Systems · 2017
521
cited
Unsupervised Perceptual Rewards for Imitation Learning
Robotics: Science and Systems · 2016
162
cited
Small-scale proxies for large-scale Transformer training instabilities
International Conference on Learning Representations · 2023
142
cited
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention
IEEE International Conference on Robotics and Automation · 2021
120
cited
On integrating a language model into neural machine translation
Computer Speech and Language · 2017
118
cited
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
International Conference on Learning Representations · 2017
113
cited
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
International Conference on Machine Learning · 2018
67
cited
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
International Conference on Machine Learning · 2023
66
cited
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
arXiv.org · 2024
46
cited
Autonomous Reinforcement Learning: Formalism and Benchmarking
International Conference on Learning Representations · 2021
35
cited
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Neural Information Processing Systems · 2020
34
cited
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
IEEE International Conference on Robotics and Automation · 2022
30
cited
Few-Shot Intent Inference via Meta-Inverse Reinforcement Learning
2018
5
cited
An Actor-Critic Algorithm for Structured Prediction
2016
3
cited
Show all 40 papers →
Sotabase