| CARVIEW |
Yifan Gao
Senior Applied Scientist, Amazon Rufus
Palo Alto, CA
Gmail: yifangao95
Google Scholar
Biography
Yifan is a Senior Applied Scientist at Amazon Rufus team working on building large language models for shopping. His work covers developing LLM scaling laws and utilizing findings to determine key pre-training modeling factors, predict benchmark performance, and guide pre-training data recipe. This enabled the team to train billion-level LLMs from scratch, surpass state-of-the-art LLMs in shopping knowledge, and serve Amazon Rufus experience.
Yifan obtained his Ph.D. in Computer Science at the Chinese University of Hong Kong under the supervision of Prof. Irwin King and Prof. Michael R. Lyu.
Research Internship positions in Amazon are available. Feel free to send your CV to me if you are interested in LLM pre-training.
Research Interests
My research interests are in natural language processing and machine learning.
During my PhD study, I teach machines to ask and answer reading comprehension questions driven by two goals:
- Knowledge Assessment: Asking questions is an effective approach to assess the understanding of users towards a passage of text. I work on 1) how to ask questions with different levels of difficulty [IJCAI'19]; 2) how to ask interconnected questions in a conversation for interactiveness and persistence [ACL'19], and 3) how to generate distractors (wrong options) in multiple-choice reading comprehension questions [AAAI'19].
- Information Acquisition: Many questions are inherently underspecified or ambiguous in our day-to-day conversations. In this case, the machine needs to 1) keep asking follow-up questions in a dialog until it gathers enough information to answer those high-level questions [ACL'20] [EMNLP'20] [arXiv'21]; or 2) present a series of disambiguated QA pairs when the query is ambiguous[ACL'21].
Publications & Preprints
-
[ICML'25] M+: Extending MemoryLLM with Scalable Long-Term MemoryYu Wang, Dmitry Krotov, Yuanzhe Hu, Yifan Gao, Wangchunshu Zhou, Julian McAuley, Dan Gutfreund, Rogerio Feris, Zexue HeICML 2025.
-
[ACL'25] EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product AssociationWeiqi Wang, Limeng Cui, Xin Liu, Sreyashi Nag, Wenju Xu, Chen Luo, Sheikh Muhammad Sarwar, Yang Li, Hansu Gu, Hui Liu, Changlong Yu, Jiaxin Bai, Yifan Gao, Haiyang Zhang, Qi He, Shuiwang Ji, Yangqiu SongACL 2025.
-
[ACL'25] UniConv: Unifying Retrieval and Response Generation for Large Language Model in ConversationFengran Mo, Yifan Gao, Chuan Meng, Xin Liu, Zhuofeng Wu, Kelong Mao, Zhengyang Wang, Pei Chen, Zheng Li, Xian Li, Bing Yin, Meng JiangACL 2025.
-
[ACL'25] Aligning Large Language Models with Implicit Preferences from User-Generated ContentZhaoxuan Tan, Zheng Li, Tianyi Liu, Haodong Wang, Hyokun Yun, Ming Zeng, Pei Chen, Zhihan Zhang, Yifan Gao, Ruijie Wang, Priyanka Nigam, Bing Yin, Meng JiangACL 2025.
-
[NAACL'25] IHEval: Evaluating Language Models on Following the Instruction HierarchyZhihan Zhang, Shiyang Li, Zixuan Zhang, Xin Liu, Haoming Jiang, Xianfeng Tang, Yifan Gao, Zheng Li, Haodong Wang, Zhaoxuan Tan, Yichuan Li, Qingyu Yin, Bing Yin, Meng JiangNAACL 2025.
-
[NAACL'25] Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-TrainingYuchen Zhuang, Jingfeng Yang, Haoming Jiang, Xin Liu, Kewei Cheng, Sanket Lokegaonkar, Yifan Gao, Qing Ping, Tianyi Liu, Binxuan Huang, Zheng Li, Zhengyang Wang, Pei Chen, Ruijie Wang, Rongzhi Zhang, Nasser Zalmout, Priyanka Nigam, Bing Yin, Chao ZhangNAACL 2025.
-
[NAACL'25] ALERT: An LLM-powered Benchmark for Automatic Evaluation of Recommendation ExplanationsYichuan Li, Xinyang Zhang, Chenwei Zhang, Mao Li, Tianyi Liu, Pei Chen, Yifan Gao, Kyumin Lee, Kaize Ding, Zhengyang Wang, Zhihan Zhang, Xian Li, Trishul ChilimbiNAACL 2025.
-
[TMLR] Scaling Laws for Predicting Downstream Performance in LLMsYangyi Chen, Binxuan Huang, Yifan Gao, Zhengyang Wang, Jingfeng Yang, Heng JiTransactions on Machine Learning Research
-
[ICML'24] MEMORYLLM: Toward Self-Updating Large Language ModelsYu Wang, Yifan Gao, Xiusi Chen, Haoming Jiang, Shiyang Li, Jingfeng Yang, Qingyu Yin, Zheng Li, Xian Li, Bing Yin, Jingbo Shang, Julian McAuley2024 International Conference on Machine Learning (ICML'24).
-
[ArXiv'24] Inductive or deductive? rethinking the fundamental reasoning abilities of llmsKewei Cheng, Jingfeng Yang, Haoming Jiang, Zhengyang Wang, Binxuan Huang, Ruirui Li, Shiyang Li, Zheng Li, Yifan Gao, Xian Li, Bing Yin, Yizhou Sunarxiv.
-
[EMNLP'24] Large Language Models in Healthcare: A Comprehensive BenchmarkFenglin Liu, Zheng Li, Hongjian Zhou, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Bing Yin, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. CliftonEMNLP 2024.
-
[NeurIPS'24 Benchmark] Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language ModelsYilun Jin, Zheng Li, Chenwei Zhang, Tianyu Cao, Yifan Gao, Pratik Jayarao, Mao Li, Xin Liu, Ritesh Sarkhel, Xianfeng Tang, Haodong Wang, Zhengyang Wang, Wenju Xu, Jingfeng Yang, Qingyu Yin, Xian Li, Priyanka Nigam, Yi Xu, Kai Chen, Qiang Yang, Meng Jiang, Bing YinNeurIPS 2024 Datasets and Benchmarks Track.
-
[SIGMOD'24] COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at AmazonChanglong Yu, Xin Liu, Jefferson Maia, Tianyu Cao, Yang Li, Yifan Gao, Yangqiu Song, Rahul Goutam, Haiyang Zhang, Bing Yin, Zheng Li2024 Symposium on Principles of Database Systems (SIGMOD'24).
-
[NeurIPS'23] Enhancing User Intent Capture in Session-Based Recommendation with Attribute PatternsXin Liu, Zheng Li, Yifan Gao, Jingfeng Yang, Tianyu Cao, Zhengyang Wang, Bing Yin, Yangqiu Song2023 Conference on Neural Information Processing Systems (NeurIPS'23).
-
[EMNLP'23-Findings] Improving Consistency for Text Summarization with Energy FunctionsQi Zeng, Qingyu Yin, Zheng Li, Yifan Gao, Sreyashi Nag, Zhengyang Wang, Bing Yin, Heng Ji, Chao ZhangThe 2023 Conference on Empirical Methods in Natural Language Processing, Findings (EMNLP'23-Findings).
-
[EMNLP'23-Findings] Knowledge-Selective Pretraining for Attribute Value ExtractionHui Liu, Qingyu Yin, Zhengyang Wang, Chenwei Zhang, Haoming Jiang, Yifan Gao, Zheng Li, Xian Li, Chao Zhang, Bing Yin, William Yang Wang, Xiaodan ZhuThe 2023 Conference on Empirical Methods in Natural Language Processing, Findings (EMNLP'23-Findings).
-
[arXiv'23] Situated Natural Language ExplanationsZining Zhu, Haoming Jiang, Jingfeng Yang, Sreyashi Nag, Chao Zhang, Jie Huang, Yifan Gao, Frank Rudzicz, Bing Yin
-
[arXiv'23] CCGen: Explainable Complementary Concept Generation in E-CommerceJie Huang, Yifan Gao, Zheng Li, Jingfeng Yang, Yangqiu Song, Chao Zhang, Zining Zhu, Haoming Jiang, Kevin Chen-Chuan Chang, Bing Yin
-
[ACL'23] Self-Consistent Chain-of-Thought DistillationPeifeng Wang, Zhengyang Wang, Zheng Li, Yifan Gao, Bing Yin and Xiang RenThe 61st Annual Meeting of the Association for Computational Linguistics (ACL’23) Outstanding Paper Award.
-
[ACL'23-Findings] Graph Reasoning for Question Answering with Triplet RetrievalShiyang Li, Yifan Gao, Haoming Jiang, Qingyu Yin, Zheng Li, Xifeng Yan, Chao Zhang and Bing YinThe 61st Annual Meeting of the Association for Computational Linguistics, Findings (ACL’23-Findings).
-
[ACL'23-Findings] FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce CommonsenseChanglong Yu, Weiqi Wang, Xin Liu, Jiaxin Bai, Yangqiu Song, Zheng Li, Yifan Gao, Tianyu Cao, Bing YinThe 61st Annual Meeting of the Association for Computational Linguistics, Findings (ACL’23-Findings).
-
[AI Open'23] Improving Task Generalization via Unified Schema PromptWanjun Zhong, Yifan Gao, Ning Ding, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan DuanAI Open, 2023.
-
[RecSys'22] Query Attribute Recommendation at Amazon SearchChen Luo, William Headean, Neela Avudaiappan, Haoming Jiang, Tianyu Cao, Qingyu Yin, Yifan Gao, Zheng Li, Rahul Goutam, Haiyang Zhang, Bing YinThe 16th ACM Recommender Systems Conference, 2022 (RecSys 2022, Industry Talk Proposal).
-
[NAACL'22] ProQA: Structural Prompt-based Pre-training for Unified Question AnsweringWanjun Zhong*, Yifan Gao*, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan (* indicates equal contribution)The 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022, Long Paper).
-
[NAACL'22-Findings] Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative TrainingYifan Gao, Qingyu Yin, zheng li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael R. LyuFindings of the Association for Computational Linguistics: NAACL 2022, Long Paper.
-
[PhD Thesis] Teaching Machines to Ask and Answer Questions: Knowledge Assessment and Information Acquisition in Reading Comprehension
-
[arXiv'21] Open-Retrieval Conversational Machine ReadingYifan Gao, Jingjing Li, Chien-Sheng Wu, Michael R. Lyu, Irwin King[arxiv] [paper] [code & data]
-
[ACL'21] Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip PredictionYifan Gao, Henghui Zhu, Patrick Ng, Cicero Nogueira dos Santos, Zhiguo Wang, Feng Nan, Dejiao Zhang, Ramesh Nallapati, Andrew O. Arnold, Bing XiangThe 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021, Long Paper, Oral).
-
[EMNLP'20] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingYifan Gao, Chien-Sheng Wu, Jingjing Li, Shafiq Joty, Steven C.H. Hoi, Caiming Xiong, Irwin King, Michael R. LyuThe 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020, Long Paper, Oral).
-
[EMNLP'20-Findings] Dialogue Generation on Infrequent Sentence Functions via Structured Meta-LearningYifan Gao, Piji Li, Wei Bi, Xiaojiang Liu, Michael R. Lyu, Irwin KingFindings of the Association for Computational Linguistics: EMNLP 2020, Long Paper.
Presented at The First Workshop on Computational Approaches to Discourse (CODI 2020). -
[ACL'20] Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingYifan Gao, Chien-Sheng Wu, Shafiq Joty, Caiming Xiong, Richard Socher, Irwin King, Michael R. Lyu, Steven C.H. HoiThe 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020, Long Paper).
-
[EMNLP'19] Improving Question Generation With to the Point ContextJingjing Li*, Yifan Gao*, Lidong Bing, Irwin King, Michael R. Lyu (* indicates equal contribution)The 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019, Long Paper, Poster).
-
[ACL'19] Interconnected Question Generation with Coreference Alignment and Conversation Flow ModelingYifan Gao, Piji Li, Irwin King, Michael R. LyuThe 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019, Long Paper, Oral).
-
[IJCAI'19] Difficulty Controllable Question Generation for Reading ComprehensionYifan Gao, Lidong Bing, Wang Chen, Michael R. Lyu, Irwin KingThe 28th International Joint Conference on Artificial Intelligence (IJCAI 2019).
-
[AAAI'19] Generating Distractors for Reading Comprehension Questions from Real ExaminationsYifan Gao, Lidong Bing, Piji Li, Irwin King, Michael R. LyuThe Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019).
-
[AAAI'19] Title-Guided Encoding for Keyphrase GenerationWang Chen, Yifan Gao, Jiani Zhang, Irwin King, Michael R. LyuThe Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019).
Awards & Honors
- ACL 2023 Outstanding Paper Award, 2023
- Global Top 100 Chinese Rising Stars in Artificial Intelligence, by Baidu Scholar, 2021. [Related News (Chinese)] [Related News] [Related News]
Academic Service
- Conference Area Chair:
- ACL/EMNLP/NAACL: 2024
- AMLC: 2024
- Journal Reviewer:
- ACM Transactions on Intelligent Systems and Technology (TIST)
- Neurocomputing
- Neural Networks (NN)
- Natural Language Engineering
- IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP)
- Conference PC Member:
- ACL: 2020, 2021, 2022, 2023
- EMNLP: 2020, 2021
- NAACL: 2021, 2022
- ICML: 2020, 2021, 2022, 2023
- NeurIPS: 2021, 2022, 2023
- ICLR: 2021, 2022, 2023
- AAAI: 2019, 2020, 2021, 2022, 2023
- IJCAI: 2021,2022
Intern Mentorship
- Jie Huang, PhD@UIUC, 2022 Summer
- Shiyang Li, PhD@UCSB, 2022 Fall
- Yu Wang, PhD@UCSD, 2023 Summer
- Jiacheng Li, PhD@UCSD, 2023 Fall
- Fengran Mo, PhD@Université de Montréal, 2024 Summer
- Scott Chang, PhD@TAMU, 2024 Fall
- Jacky Chan, PhD@HKUST, 2025 Spring