Taiwei Shi

Projects

Things I do, including research, academic course projects, and miscellaneous interests.

Research

Research publications for fans of natural language processing, computational social science, and machine learning.

On the Trustworthiness of Generative Foundation Models: Guideline, Benchmark, and Perspective
Preprint, 2025
Detecting and Filtering Unsafe Training Data via Data Attribution
Preprint, 2025
Aligning LLMs With In-situ User Interactions And Feedback
Preprint, 2024
Exposure to only a small amount of ideologically driven samples significantly alters the ideology of LLMs
EMNLP, 2024
Can Language Model Moderators Improve the Health of Online Discourse?
NAACL, 2024
Safer-Instruct Aligning Language Models with Automated Preference Data
NAACL, 2024
Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
EMNLP, 2023
Combining symbolic and neural story generation
AAAI Creative AI Workshop, 2023
Investigating AAVE in Question Answering Systems
GT Thesis, 2023