| CARVIEW |
Ximing Lu
I am a Ph.D. candidate at the University of Washington, advised by Professor Yejin Choi. Previously, I received my B.S. degree in Computer Science at University of Washington.
My broad research goal is to understand the boundaries of machine intelligence and bridge the capability gap between models and humans by exploring alternative paths beyond scaling, such as algorithmic innovations and knowledge enhancement. Over the past few years, I have focused on studying the capabilities and limits of language models, as well as developing learning and inference algorithms to unlock capabilities in smaller models, for example:
- I investigate the fundamental limits of Transformer language models in the context of compositional tasks in my work Faith and Fate. I explore the divergence in the configuration of machine and human intelligence by proposing and testing the Generative AI Paradox.
- I have worked to develop a suite of learning and decoding-time methods to empower compact and efficient language models, including NeuroLogic Decoding, NeuroLogic A*esque Decoding, Quark, and Inference-Time Policy Adapters.
Email: lux32 [at] cs.washington.edu
Links: [Google Scholar] [Twitter] [Github] [CV] [Research Statement]
Publications
Publications are listed in reverse chronological order. For a list of all publications, please check out my Google Scholar
-
Information-Theoretic Distillation for Reference-less Summarization
Jaehun Jung, Ximing Lu, Liwei Jiang, Faeze Brahman, Peter West, Pang Wei Koh, Yejin Choi
arXiv:2403.13780
-
A Roadmap to Pluralistic Alignment
Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Yejin Choi
ICML 2024
-
Impossible Distillation: From Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing
Jaehun Jung, Peter West, Liwei Jiang, Faeze Brahman, Ximing Lu, Jillian Fisher, Taylor Sorensen, Yejin Choi
NAACL 2024
-
JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models
Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaid Harchaoui, Yejin Choi
NAACL 2024
-
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
Linlu Qiu, Liwei Jiang, Ximing Lu, Melanie Sclar, Valentina Pyatkin, Chandra Bhagavatula, Bailin Wang, Yoon Kim, Yejin Choi, Nouha Dziri, Xiang Ren
ICLR 2024, Oral (1.2%)
-
THE GENERATIVE AI PARADOX: “What It Can Create, It May Not Understand”
*Peter West, *Ximing Lu, *Nouha Dziri, *Faeze Brahman, *Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi
ICLR 2024
-
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu, Nouha Dziri, Melanie Sclar, Khyathi Chandu, Chandra Bhagavatula, Yejin Choi
ICLR 2024
-
Tailoring Self-Rationalizers with Multi-Reward Distillation
Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren
ICLR 2024
-
Improving Language Models with Advantage-Based Offline Policy Gradients
Ashutosh Baheti, Ximing Lu, Faeze Brahman, Ronan Le Bras, Maarten Sap, Mark Riedl
ICLR 2024
-
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Taylor Sorensen, Liwei Jiang, Jena D Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin Choi
AAAI 2024
-
Faith and Fate: Limits of Transformers on Compositionality
*Nouha Dziri, *Ximing Lu, *Melanie Sclar, +Xiang Lorraine Li, +Liwei Jiang, +Bill Yuchen Lin, Sean Welleck, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena Hwang, Soumya Sanyal, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi
NeurIPS 2023, Spotlight
-
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
Jae Sung Park, Jack Hessel, Khyathi Chandu, Paul Pu Liang, Ximing Lu, Peter West, Youngjae Yu, Qiuyuan Huang, Jianfeng Gao, Ali Farhadi, Yejin Choi
NeurIPS 2023
-
Soda: Million-scale Dialogue Distillation with Social Commonsense Contextualization
Hyunwoo Kim, Jack Hessel, Liwei Jiang, Peter West, Ximing Lu, Youngjae Yu, Pei Zhou, Ronan Le Bras, Malihe Alikhani, Gunhee Kim, Maarten Sap, Yejin Choi
EMNLP 2023, Outstanding Paper Award
-
Inference-time Policy Adapters (IPA): Tailoring Extreme-Scale LMs Without Fine-Tuning
Ximing Lu, Faeze Brahman, Peter West, Jaehun Jung, Khyathi Chandu, Abhilasha Ravichander, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Lin, Skyler Hallinan, Lianhui Qin, Xiang Ren, Sean Welleck, Yejin Choi
EMNLP 2023
-
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Peter West, Ronan Le Bras, Taylor Sorensen, Bill Yuchen Lin, Liwei Jiang, Ximing Lu, Khyathi Chandu, Jack Hessel, Ashutosh Baheti, Chandra Bhagavatula, Yejin Choi
Findings of EMNLP 2023
-
STEER: Unified Style Transfer with Expert Reinforcement
Skyler Hallinan, Faeze Brahman, Ximing Lu, Jaehun Jung, Sean Welleck, Yejin Choi
Findings of EMNLP 2023
-
In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren
arXiv:2311.07237
-
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations
Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei Jiang, Yejin Choi, Chandra Bhagavatula
ACL 2023
-
I2D2: Inductive Knowledge Distillation with Neurologic and Self-Imitation
Chandra Bhagavatula, Jena D. Hwang, Doug Downey, Ronan Le Bras, Ximing Lu, Keisuke Sakaguchi, Swabha Swayamdipta, Peter West, Yejin Choi
ACL 2023
-
Multimodal Knowledge Alignment with Reinforcement Learning
Youngjae Yu, Jiwan Chung, Heeseung Yun, Jack Hessel, Jae Sung Park, Ximing Lu, Rowan Zellers, Prithviraj Ammanabrolu, Ronan Le Bras, Gunhee Kim, Yejin Choi
CVPR 2023
-
Generating Sequences by Learning to Self-Correct
*Sean Welleck, *Ximing Lu, +Peter West, +Faeze Brahman, Tianxiao Shen, Daniel Khashabi, Yejin Choi
ICLR 2023
-
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu, Sean Welleck, Liwei Jiang, Jack Hessel, Lianhui Qin, Peter West, Prithviraj Ammanabrolu, Yejin Choi
NeurIPS 2022, Oral
-
Naturalprover: Grounded Mathematical Proof Generation with Language Models
Sean Welleck, Jiacheng Liu, Ximing Lu, Hannaneh Hajishirzi, Yejin Choi
NeurIPS 2022
-
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Jiacheng Liu, Skyler Hallinan, Ximing Lu, Pengfei He, Sean Welleck, Hannaneh Hajishirzi, Yejin Choi
EMNLP 2022
-
Prosocialdialog: A Prosocial Backbone for Conversational Agents
Hyunwoo Kim, Youngjae Yu, Liwei Jiang, Ximing Lu, Daniel Khashabi, Gunhee Kim, Yejin Choi, Maarten Sap
EMNLP 2022
-
Twist Decoding: Diverse Generators Guide Each Other
Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Hao Peng, Ximing Lu, Dragomir Radev, Yejin Choi, Noah A. Smith
EMNLP 2022
-
MERLOT Reserve: Multimodal Neural Script Knowledge through Vision and Language and Sound
Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin Choi
CVPR 2022, Oral
-
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi
NAACL 2022, Oral
-
NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Ximing Lu, +Sean Welleck, +Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah Smith, Yejin Choi
NAACL 2022, Best Paper Award
-
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
Peter West, Chandra Bhagavatula, Jack Hessel, Jena D. Hwang, Liwei Jiang, Ronan Le Bras, Ximing Lu, Sean Welleck, Yejin Choi
NAACL 2022, Oral
-
Generated Knowledge Prompting for Commonsense Reasoning
Jiacheng Liu, Alisa Liu, Ximing Lu, Sean Welleck, Peter West, Ronan Le Bras, Yejin Choi, Hannaneh Hajishirzi
ACL 2022
-
🍷MERLOT: Multimodal Neural Script Knowledge Models
*Rowan Zellers, *Ximing Lu, *Jack Hessel, Youngjae Yu, Jae Sung Park, Jize Cao, Ali Farhadi, Yejin Choi
NeurIPS 2021, Oral (1%)
-
Analyzing Commonsense Emergence in Few-shot Knowledge Models
Jeff Da, Ronan Le Bras, Ximing Lu, Yejin Choi, Antoine Bosselut
AKBC 2021
-
DExperts: On-the-Fly Controlled Text Generation with Experts and Anti-Experts
Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah Smith, Yejin Choi
ACL 2021, Oral
-
Reflective Decoding: Beyond Unidirectional Generation with Off-the-shelf Language Models
Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang, Yejin Choi
ACL 2021
-
On-the-Fly Attention Modulation for Neural Generation
Yue Dong, Chandra Bhagavatula, Ximing Lu, Jena D. Hwang, Antoine Bosselut, Jackie Chi Kit Cheung, Yejin Choi
ACL 2021 Findings
-
NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints
Ximing Lu, Peter West, Rowan Zellers, Ronan Le Bras, Chandra Bhagavatula, Yejin Choi
NAACL 2021
-
End-to-End Diagnosis of Breast Biopsy Images with Transformers
*Sachin Mehta, *Ximing Lu, Wenjun Wu, Donald Weaver, Hannaneh Hajishirzi, Joann Elmore, Linda Shapiro
Medical Image Analysis 79, 102466
-
Applications of the ESPNet Architecture in Medical Imaging
Sachin Mehta, Nicholas Nuechterlein, Ezgi Mercan, Beibin Li, Shima Nofallah, Wenjun Wu, Ximing Lu, Anat Caspi, Mohammad Rastegari, Joann Elmore, Hannaneh Hajishirzi, Linda Shapiro
State of the Art in Neural Networks and their Applications, 117-131
-
Analysis of Regions of Interest and Distractor Regions in Breast Biopsy Images
Ximing Lu, Sachin Mehta, Tad Brunyé, Donald Weaver, Joann Elmore, Linda Shapiro
BHI 2021
Honors & Awards
-
(2023) Outstanding Paper Award at EMNLP
-
(2023) Best Senior Thesis Award, Paul G. Allen School of Computer Science & Engineering
-
(2022) Best Paper Award at NAACL
-
(2020) Outstanding Undergraduate Researcher Award Runners-Up, Computing Research Association
-
(2020) Lisa Simonyi Prize, Paul G. Allen School of Computer Science & Engineering
-
(2020) Levinson Emerging Scholars Award, University of Washington
-
(2020) Mary Gates Research Scholarship, University of Washington
-
(2019) Denton, Denice Dee Scholars Endowment, Paul G. Allen School of Computer Science & Engineering
-
(2018) Second Prize of UW Datathon, Citadel Investment Group, LLC
-
(2018) Conference Travel Award, University of Washington
Teaching Experience
- (Winter, 2024) TA @ CSE 447/517 (Undergrad/Grad NLP) at University of Washington
- (Winter, 2021)-TA @ CSE P517 (Professional NLP) at University of Washington