| CARVIEW |
Liwei Jiang | 姜力炜 Ph.D. Candidate
Paul G. Allen School of Computer Science & Engineering, University of Washington
I am a final-year Ph.D. candidate at Paul G. Allen School of Computer Science & Engineering, University of Washington, advised by Prof. Yejin Choi. I am also a graduate student researcher at NVIDIA and was previously a student researcher at Allen Institute for Artificial Intelligence (Ai2).
My research centers on Humanistic, Pluralistic, and Coevolutionary AI Safety and Alignment aiming to foster the long-term secure, sustainable, and synergistic coevolution of AI and humanity:
From Human to AI: Developing human-centered, ever-evolving, and future-oriented AI systems, anchored in interdisciplinary insights into human intelligence, values, and global needs.
From AI to Human: Advancing the frontiers of human knowledge, augmenting human capabilities, and addressing consequential sociotechnical challenges through robust, efficient, and scalable innovations in data, learning algorithms, and AI system design.
My current research focuses on developing data, algorithmic, and system-level solutions to address sociotechnical challenges in AI safety, security, and LLM alignment, often through multi-agent, RL, and data synthesis angles. My works have spearheaded research on moral and (pluralistic) value reasoning of LLMs. An overview of my research:
News
Publications (*, + indicate equal contribution) Google Scholar
Awards
🏆 Best Paper Award
🏆 Outstanding Paper Award
🏆 Best Paper Award
🏆 Outstanding Paper Award
🏆 Best Paper Award
Anne Dinning - Michael Wolf Endowed Regental Fellowship
Member of the Phi Beta Kappa Society
Honorable Mention of Interdisciplinary Contest in Modeling (ICM)
Phi Beta Kappa Undergraduate Scholastic Achievement Award
Julius Seelye Bixler Scholar
Phi Beta Kappa Summer Research Scholar
Education & Experience
Education
University of Washington
Colby College
Professional Experience
NVIDIA
Allen Institute for Artificial Intelligence (Ai2)
Stanford University
The Future Laboratory, Tsinghua University
Teaching & Services
Courses
CSE447/517 Natural Language Processing—An LLM Version (Spring 2024, Grad + Undergrad. Instructed by Prof. Yejin Choi)
CSE 599 D1 Exploration on Language, Knowledge, and Reasoning (Winter 2023, Grad, Instructed by Prof. Yejin Choi)
Conference Tutorials
Guardrails and Security for LLMs: Safe, Secure, and Controllable Steering of LLM Applications
Guest Lectures
CSE 447: Natural Language Processing, University of Washington
COM SCI 162: Natural Language Processing, UCLA
11-830: Ethics, Social Biases, and Positive Impact in Language Technologies, CMU
IS504: Sociotechnical Information Systems, UIUC
CS475: ML for NLP, KAIST, South Korea
CSE 447: Natural Language Processing, University of Washington
CS1684/2084: Bias and Ethical Implications in Artificial Intelligence, University of Pittsburgh
CSE 163: Intermediate Data Programming, University of Washington
Ethics and Citizenship, The Downtown School, Seattle
CS496: AI Perspectives: Symbolic Reasoning to Deep Learning, Northwestern University
LAW E 553: Technology Law And Public Policy Seminar, University of Washington
Ethics and Citizenship, The Downtown School, Seattle
HONORS 222 B: Artificial Intelligence Meets Society, University of Washington
Workshop Organizations
3rd Edition of Socially Responsible Language Modelling Research (SoLaR)
2nd Edition of Socially Responsible Language Modelling Research (SoLaR)
AI Meets Moral Philosophy and Moral Psychology: An Interdisciplinary Dialogue about Computational Ethics (MP2)
Talks
UCLA NLP Seminar | UIUC NLP Seminar | UIUC ECE (Hosted by Prof. Huan Zhang)
NeurIPS 2025 | NVIDIA | Ploutos | University of Toronto (Hosted by Prof. Ebrahim Bagheri) | Zhiyuan Talk | AI TIME
Netskope
Darpa ITM PI Meeting
University of Washington, Foster School of Business, Computational Minds and Machines lab
Annual Research Showcase and Open House Event, UW CSE
All-Ai2 Meeting, Allen Institute for Artificial Intelligence (Ai2)
The Big Picture Workshop, EMNLP, Singapore
Darpa ITM Kickoff PI Meeting
Mosaic Morality & AI Series, Allen Institute for Artificial Intelligence (Ai2)
UW NLP Retreat
All-Ai2 Meeting, Allen Institute for Artificial Intelligence (Ai2)
Bio
Liwei Jiang is a final-year Ph.D. candidate in the Paul G. Allen School of Computer Science & Engineering at the University of Washington, advised by Prof. Yejin Choi. She was previously a graduate student researcher at NVIDIA and the Allen Institute for Artificial Intelligence (Ai2). Her research focuses on humanistic, pluralistic, and coevolutionary AI safety and alignment, where she spearheads research on moral and pluralistic value reasoning in language models and develops data-, algorithm-, and system-level solutions to socio-technical challenges in AI safety, security, and large language model alignment. Her work has received Best Paper Awards at NeurIPS 2025, NAACL 2022, and CHI 2024, as well as Outstanding Paper Awards at EMNLP 2023 and the AIA Workshop at COLM 2025, and has been featured in The New York Times, Nature Outlook, IEEE Spectrum, Wired, and other major media. She co-organizes workshops including MP2 (NeurIPS 2023) and SoLaR (NeurIPS 2024; COLM 2025), and co-leads the Guardrails and Security for LLMs tutorial at ACL 2025.
姜力炜是华盛顿大学保罗·艾伦计算机科学与工程学院(Paul G. Allen School of Computer Science & Engineering at University of Washington)的博士生,师从Yejin Choi教授。她曾是英伟达(NVIDIA)和艾伦人工智能研究所(Ai2)的研究生研究员。她的研究专注于以人为本、多元化和协同演化的人工智能安全与对齐,在语言模型的道德和多元价值推理方面开展了开创性研究,并开发了数据、算法和系统级解决方案,以应对人工智能安全、安保和大语言模型对齐中的社会技术挑战。她的工作获得了NeurIPS 2025、NAACL 2022和CHI 2024的最佳论文奖,以及EMNLP 2023和COLM 2025 AIA研讨会的杰出论文奖,并被《纽约时报》、《Nature Outlook》、《IEEE Spectrum》、《Wired》等主流媒体报道。她共同组织了MP2(NeurIPS 2023)和SoLaR(NeurIPS 2024; COLM 2025)研讨会,并共同主持ACL 2025的《大语言模型的护栏与安全》教程。
Personal
I deeply value mentorship and am profoundly grateful to the mentors who have shaped and supported my research journey (in alphabetical order): Chandra Bhagavatula, Antoine Bosselut, Yejin Choi, Oren Etzioni, Erick Galinkin, Jena D. Hwang, Natasha Jaques, James Landay, Ronan Le Bras, Christopher Parisien, Sherry Ruan, Maarten Sap, and Yulia Tsvetkov.
I firmly believe that everyone has the potential to achieve anything they set their mind to. Keep going and try again.
Your path is uniquely yours. Follow what ignites you. Every twist, every turn, every unexpected direction is exactly where you need to be.
Two cats, an orange tabby named Loopy and an orange british shorthair named Loafy, adopted me as their owner.
My forever role model: RBG (Ruth Bader Ginsburg).