Sotabase
Home
Researchers
Career
·
Assistant Professor
,
University of Toronto
2011–
Publications
(408)
Dropout: a simple way to prevent neural networks from overfitting
Journal of machine learning research · 2014
42,312
cited
Supporting Online Material for Reducing the Dimensionality of Data with Neural Networks
2006
11,470
cited
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
International Conference on Machine Learning · 2015
10,623
cited
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Neural Information Processing Systems · 2019
9,140
cited
Improving neural networks by preventing co-adaptation of feature detectors
arXiv.org · 2012
7,937
cited
Probabilistic Matrix Factorization
Neural Information Processing Systems · 2007
4,746
cited
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
Annual Meeting of the Association for Computational Linguistics · 2019
4,166
cited
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
IEEE/ACM Transactions on Audio Speech and Language Processing · 2021
4,111
cited
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Conference on Empirical Methods in Natural Language Processing · 2018
3,782
cited
Human-level concept learning through probabilistic program induction
Science · 2015
3,218
cited
Deep Sets
2017
2,756
cited
Unsupervised Learning of Video Representations using LSTMs
International Conference on Machine Learning · 2015
2,691
cited
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
IEEE International Conference on Computer Vision · 2015
2,689
cited
Skip-Thought Vectors
Neural Information Processing Systems · 2015
2,470
cited
Revisiting Semi-Supervised Learning with Graph Embeddings
International Conference on Machine Learning · 2016
2,379
cited
Deep Boltzmann Machines
International Conference on Artificial Intelligence and Statistics · 2009
2,326
cited
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
arXiv.org · 2022
2,201
cited
Neighbourhood Components Analysis
Neural Information Processing Systems · 2004
2,120
cited
Restricted Boltzmann machines for collaborative filtering
International Conference on Machine Learning · 2007
2,076
cited
Multimodal Transformer for Unaligned Multimodal Language Sequences
Annual Meeting of the Association for Computational Linguistics · 2019
1,829
cited
Show all 408 papers →
Sotabase