Sotabase
Home
Researchers
Career
·
Advisor
,
Scale AI
2024–
·
Executive Director
,
Center for AI Safety
2023–
·
Advisor
,
xAI
2023–
·
Research Intern
,
DeepMind
2019–2019
Publications
(102)
Measuring Massive Multitask Language Understanding
International Conference on Learning Representations · 2020
6,871
cited
Gaussian Error Linear Units (GELUs)
2016
6,235
cited
Measuring Mathematical Problem Solving With the MATH Dataset
NeurIPS Datasets and Benchmarks · 2021
4,229
cited
Benchmarking Neural Network Robustness to Common Corruptions and Perturbations
International Conference on Learning Representations · 2019
4,018
cited
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks
International Conference on Learning Representations · 2016
3,971
cited
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
arXiv.org · 2022
2,201
cited
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
IEEE International Conference on Computer Vision · 2020
2,143
cited
Natural Adversarial Examples
Computer Vision and Pattern Recognition · 2019
1,768
cited
Deep Anomaly Detection with Outlier Exposure
International Conference on Learning Representations · 2018
1,663
cited
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
International Conference on Learning Representations · 2019
1,506
cited
Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty
Neural Information Processing Systems · 2019
1,033
cited
Measuring Coding Challenge Competence With APPS
NeurIPS Datasets and Benchmarks · 2021
940
cited
Using Pre-Training Can Improve Model Robustness and Uncertainty
International Conference on Machine Learning · 2019
800
cited
Aligning AI With Shared Human Values
International Conference on Learning Representations · 2020
784
cited
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
International Conference on Machine Learning · 2024
781
cited
Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units
arXiv.org · 2016
773
cited
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv.org · 2023
739
cited
Scaling Out-of-Distribution Detection for Real-World Settings
International Conference on Machine Learning · 2022
603
cited
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
Neural Information Processing Systems · 2018
594
cited
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Neural Information Processing Systems · 2023
568
cited
Show all 102 papers →
Sotabase
Dan Hendrycks | Researcher Profile | Sotabase | Sotabase