Sotabase
Home
Researchers
Career
·
Computational Astrophysics Researcher
,
Bowdoin College
2024–2024
·
Physics Learning Assistant
,
Bowdoin College
2024–
·
Research Assistant
,
Center for Astrophysics | Harvard & Smithsonian
2023–2023
Publications
(55)
Language Models (Mostly) Know What They Know
arXiv.org · 2022
1,204
cited
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
arXiv.org · 2022
656
cited
Discovering Language Model Behaviors with Model-Written Evaluations
Annual Meeting of the Association for Computational Linguistics · 2022
621
cited
Spatially Ordered Dynamics of the Bacterial Carbon Fixation Machinery
Science · 2010
328
cited
Measuring Faithfulness in Chain-of-Thought Reasoning
arXiv.org · 2023
321
cited
Designing biological compartmentalization.
Trends in Cell Biology · 2012
255
cited
The Capacity for Moral Self-Correction in Large Language Models
arXiv.org · 2023
195
cited
Measuring Progress on Scalable Oversight for Large Language Models
arXiv.org · 2022
177
cited
The Bacterial Carbon-Fixing Organelle Is Formed by Shell Envelopment of Preassembled Cargo
PLoS ONE · 2013
132
cited
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
arXiv.org · 2023
108
cited
Spatial and Temporal Organization of Chromosome Duplication and Segregation in the Cyanobacterium Synechococcus elongatus PCC 7942
PLoS ONE · 2012
59
cited
Specific versus General Principles for Constitutional AI
arXiv.org · 2023
44
cited
Transplantability of a circadian clock to a noncircadian organism
Science Advances · 2015
32
cited
DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Intervention
Annual Meeting of the Association for Computational Linguistics · 2024
26
cited
Modeling the Tumor Microenvironment and Cancer Immunotherapy in Next-Generation Humanized Mice
Cancers · 2023
24
cited
HuatuoGPT, Towards Taming Language Models To Be a Doctor
21
cited
SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models
North American Chapter of the Association for Computational Linguistics · 2025
12
cited
Discovery and characterization of tumor antigens in hepatocellular carcinoma for mRNA vaccine development
Journal of Cancer Research and Clinical Oncology · 2022
7
cited
Ask Again, Then Fail: Large Language Models’ Vacillations in Judgment
Volume 1 · 2024
6
cited
Prototypical Reward Network for Data Efficient Model Alignment
5
cited
Show all 55 papers →
Sotabase
Anna Chen | Researcher Profile | Sotabase | Sotabase