Dan Hendrycks | Researcher Profile | Sotabase

Career

· Advisor, Scale AI2024–

· Executive Director, Center for AI Safety2023–

· Advisor, xAI2023–

· Research Intern, DeepMind2019–2019

Publications (102)

Measuring Massive Multitask Language Understanding

International Conference on Learning Representations · 2020

6,871

cited

Gaussian Error Linear Units (GELUs)

2016

6,235

cited

Measuring Mathematical Problem Solving With the MATH Dataset

NeurIPS Datasets and Benchmarks · 2021

4,229

cited

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

International Conference on Learning Representations · 2019

4,018

cited

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

International Conference on Learning Representations · 2016

3,971

cited

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

arXiv.org · 2022

2,201

cited

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

IEEE International Conference on Computer Vision · 2020

2,143

cited

Natural Adversarial Examples

Computer Vision and Pattern Recognition · 2019

1,768

cited

Deep Anomaly Detection with Outlier Exposure

International Conference on Learning Representations · 2018

1,663

cited

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

International Conference on Learning Representations · 2019

1,506

cited

Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty

Neural Information Processing Systems · 2019

1,033

cited

Measuring Coding Challenge Competence With APPS

NeurIPS Datasets and Benchmarks · 2021

940

cited

Using Pre-Training Can Improve Model Robustness and Uncertainty

International Conference on Machine Learning · 2019

800

cited

Aligning AI With Shared Human Values

International Conference on Learning Representations · 2020

784

cited

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

International Conference on Machine Learning · 2024

781

cited

Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units

arXiv.org · 2016

773

cited

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv.org · 2023

739

cited

Scaling Out-of-Distribution Detection for Real-World Settings

International Conference on Machine Learning · 2022

603

cited

Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise

Neural Information Processing Systems · 2018

594

cited

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Neural Information Processing Systems · 2023

568

cited

Sotabase