Archive
The one-stop shop, including all posts from the Blog, Monthly Music, and Projects.
2025
- Efficient Reinforcement Finetuning via Adaptive Curriculum Learning papers
- Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base papers
- On the Trustworthiness of Generative Foundation Models papers
- Detecting and Filtering Unsafe Training Data via Data Attribution papers
2024
- WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback papers
- How Susceptible are Large Language Models to Ideological Manipulation? papers
2023
- Can Language Model Moderators Improve the Health of Online Discourse? papers
- Safer-Instruct: Aligning Language Models with Automated Preference Data papers
- CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation papers
- Positive Reframing Keyboard projects
- Neural Story Planning papers
- Investigating AAVE in Question Answering Systems papers
- 2016 Year In Review blog