Kelvin Xu | Researcher Profile | Sotabase

Career

· Research Scientist, DeepMind

· Brain Residency Program (now AI Residency), Google

· TBD, Meta

· MSc Student, MILA lab, Université de Montréal

· PhD Student, University of California, Berkeley

· Undergraduate Student, University of Toronto

Publications (40)

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

arXiv.org · 2024

3,137

cited

Theano: A Python framework for fast computation of mathematical expressions

arXiv.org · 2016

2,366

cited

Probabilistic Model-Agnostic Meta-Learning

Neural Information Processing Systems · 2018

726

cited

Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

International Conference on Learning Representations · 2019

673

cited

An Actor-Critic Algorithm for Sequence Prediction

International Conference on Learning Representations · 2016

661

cited

On Using Monolingual Corpora in Neural Machine Translation

arXiv.org · 2015

580

cited

Bridging the Gap Between Value and Policy Based Reinforcement Learning

Neural Information Processing Systems · 2017

521

cited

Unsupervised Perceptual Rewards for Imitation Learning

Robotics: Science and Systems · 2016

162

cited

Small-scale proxies for large-scale Transformer training instabilities

International Conference on Learning Representations · 2023

142

cited

Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention

IEEE International Conference on Robotics and Automation · 2021

120

cited

On integrating a language model into neural machine translation

Computer Speech and Language · 2017

118

cited

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control

International Conference on Learning Representations · 2017

113

cited

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

International Conference on Machine Learning · 2018

cited

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

International Conference on Machine Learning · 2023

cited

Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries

arXiv.org · 2024

cited

Autonomous Reinforcement Learning: Formalism and Benchmarking

International Conference on Learning Representations · 2021

cited

Continual Learning of Control Primitives: Skill Discovery via Reset-Games

Neural Information Processing Systems · 2020

cited

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

IEEE International Conference on Robotics and Automation · 2022

cited

Few-Shot Intent Inference via Meta-Inverse Reinforcement Learning

2018

cited

An Actor-Critic Algorithm for Structured Prediction

2016

cited

Sotabase