Anna Chen | Researcher Profile | Sotabase

Career

· Computational Astrophysics Researcher, Bowdoin College2024–2024

· Physics Learning Assistant, Bowdoin College2024–

· Research Assistant, Center for Astrophysics | Harvard & Smithsonian2023–2023

Publications (55)

Language Models (Mostly) Know What They Know

arXiv.org · 2022

1,204

cited

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

arXiv.org · 2022

656

cited

Discovering Language Model Behaviors with Model-Written Evaluations

Annual Meeting of the Association for Computational Linguistics · 2022

621

cited

Spatially Ordered Dynamics of the Bacterial Carbon Fixation Machinery

Science · 2010

328

cited

Measuring Faithfulness in Chain-of-Thought Reasoning

arXiv.org · 2023

321

cited

Designing biological compartmentalization.

Trends in Cell Biology · 2012

255

cited

The Capacity for Moral Self-Correction in Large Language Models

arXiv.org · 2023

195

cited

Measuring Progress on Scalable Oversight for Large Language Models

arXiv.org · 2022

177

cited

The Bacterial Carbon-Fixing Organelle Is Formed by Shell Envelopment of Preassembled Cargo

PLoS ONE · 2013

132

cited

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

arXiv.org · 2023

108

cited

Spatial and Temporal Organization of Chromosome Duplication and Segregation in the Cyanobacterium Synechococcus elongatus PCC 7942

PLoS ONE · 2012

cited

Specific versus General Principles for Constitutional AI

arXiv.org · 2023

cited

Transplantability of a circadian clock to a noncircadian organism

Science Advances · 2015

cited

DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Intervention

Annual Meeting of the Association for Computational Linguistics · 2024

cited

Modeling the Tumor Microenvironment and Cancer Immunotherapy in Next-Generation Humanized Mice

Cancers · 2023

cited

HuatuoGPT, Towards Taming Language Models To Be a Doctor

cited

SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models

North American Chapter of the Association for Computational Linguistics · 2025

cited

Discovery and characterization of tumor antigens in hepatocellular carcinoma for mRNA vaccine development

Journal of Cancer Research and Clinical Oncology · 2022

cited

Ask Again, Then Fail: Large Language Models’ Vacillations in Judgment

Volume 1 · 2024

cited

Prototypical Reward Network for Data Efficient Model Alignment

cited

Sotabase