Sotabase
Home
Researchers
Career
·
PiTech Fellow
,
Cornell University
2024–
·
PhD Student
,
Cornell University
2022–
·
Instructor
,
Princeton University
2021–
·
Undergraduate Student
,
Princeton University
2018–2022
Publications
(14)
Mitigating dataset harms requires stewardship: Lessons from 1000 papers
NeurIPS Datasets and Benchmarks · 2021
106
cited
REFORMS: Consensus-based Recommendations for Machine-learning-based Science
Science Advances · 2024
64
cited
REFORMS: Reporting Standards for Machine Learning Based Science
arXiv.org · 2023
27
cited
Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers
North American Chapter of the Association for Computational Linguistics · 2024
24
cited
Sparse Autoencoders for Hypothesis Generation
International Conference on Machine Learning · 2025
16
cited
Reconciling the Accuracy-Diversity Trade-off in Recommendations
The Web Conference · 2023
15
cited
Use Sparse Autoencoders to Discover Unknown Concepts, Not to Act on Known Concepts
arXiv.org · 2025
12
cited
Monoculture in Matching Markets
Neural Information Processing Systems · 2023
10
cited
Correlated Errors in Large Language Models
International Conference on Machine Learning · 2025
7
cited
Wisdom and Foolishness of Noisy Matching Markets
ACM Conference on Economics and Computation · 2024
6
cited
A No Free Lunch Theorem for Human-AI Collaboration
AAAI Conference on Artificial Intelligence · 2024
5
cited
Mixing times of one-sided $k$-transposition shuffles
2021
3
cited
Checklist for reporting ML-based science
2023
Sotabase
Kenny Peng | Researcher Profile | Sotabase | Sotabase