Sotabase
Home
Researchers
Career
·
Associate Professor
,
University of Toronto
2023–
Publications
(146)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research · 2019
24,173
cited
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Neural Information Processing Systems · 2020
4,348
cited
MixMatch: A Holistic Approach to Semi-Supervised Learning
Neural Information Processing Systems · 2019
3,391
cited
Emergent Abilities of Large Language Models
Trans. Mach. Learn. Res. · 2022
3,200
cited
mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer
North American Chapter of the Association for Computational Linguistics · 2020
2,986
cited
librosa: Audio and Music Signal Analysis in Python
SciPy · 2015
2,821
cited
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
arXiv.org · 2022
2,792
cited
Extracting Training Data from Large Language Models
USENIX Security Symposium · 2020
2,566
cited
Theano: A Python framework for fast computation of mathematical expressions
arXiv.org · 2016
2,366
cited
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
arXiv.org · 2022
2,201
cited
Multitask Prompted Training Enables Zero-Shot Task Generalization
arXiv.org · 2021
1,920
cited
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Neural Information Processing Systems · 2022
1,197
cited
How Much Knowledge Can You Pack into the Parameters of a Language Model?
Conference on Empirical Methods in Natural Language Processing · 2020
1,000
cited
Realistic Evaluation of Semi-Supervised Learning Algorithms
International Conference on Learning Representations · 2018
876
cited
ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring
arXiv.org · 2019
752
cited
Thermometer Encoding: One Hot Way To Resist Adversarial Examples
International Conference on Learning Representations · 2018
639
cited
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models
Transactions of the Association for Computational Linguistics · 2021
612
cited
MIR_EVAL: A Transparent Implementation of Common MIR Metrics
International Society for Music Information Retrieval Conference · 2014
606
cited
Large Language Models Struggle to Learn Long-Tail Knowledge
International Conference on Machine Learning · 2022
561
cited
Crosslingual Generalization through Multitask Finetuning
Annual Meeting of the Association for Computational Linguistics · 2023
556
cited
Show all 146 papers →
Sotabase