Projects

Things I do, including research, academic course projects, and miscellaneous interests.

Research

Research publications for fans of natural language processing, computational social science, and machine learning.

A benchmark assessing the steerability of large language models using Reddit communities across 30 subreddit pairs in 19 domains.

EMNLP, 2025

Enabling LLMs to Reason About Uncertainty

EMNLP Findings, 2025

Introducing computer-using agents with coding as actions, a novel paradigm for task automation.

Preprint, 2025

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Preprint, 2025

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

COLM, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Benchmark, and Perspective

Preprint, 2025

Detecting and Filtering Unsafe Training Data via Data Attribution

Preprint, 2025

Aligning LLMs With In-situ User Interactions And Feedback

Preprint, 2024

Exposure to only a small amount of ideologically driven samples significantly alters the ideology of LLMs

EMNLP, 2024

Can Language Model Moderators Improve the Health of Online Discourse?

NAACL, 2024

Safer-Instruct Aligning Language Models with Automated Preference Data

NAACL, 2024

Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

EMNLP, 2023

Combining symbolic and neural story generation

AAAI Creative AI Workshop, 2023

GT Thesis, 2023