| CARVIEW |
Originally from Beijing, I am a final-year Ph.D. student at MIT CSAIL, advised by Yoon Kim. My research in NLP focuses on training robust, generalizable language models and developing rigorous frameworks to evaluate them. Previously, I earned my M.S. in Computer Science, B.S. in Computer Science, and B.A. in Linguistics from the University of Washington.
My work sits at the intersection of language and computation. I have conducted research at leading labs including Google, Meta, the Allen Institute for Artificial Intelligence (AI2) (as a PYI), where I worked on problems ranging from academic research to large-scale model training.
With Alexis and Shannon, I created cs-sop.org, a platform where past CS PhD program applicants generously share their statements of purpose and advice. We hope this could be a useful resource for future applicants to navigate the application process.
💼 I am on the 2025-2026 industry job market.
Publications
* = Equal Contribution
-
Zhaofeng Wu, Michihiro Yasunaga, Andrew Cohen, Yoon Kim, Asli Celikyilmaz, and Marjan GhazvininejadIn Empirical Methods in Natural Language Processing (EMNLP), 2025.
-
Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James Glass, Shang-Wen Li, and Wen-tau YihIn International Conference on Machine Learning (ICML), 2025.
-
Zhaofeng Wu, Xinyan Velocity Yu, Dani Yogatama, Jiasen Lu, and Yoon KimIn International Conference on Learning Representations (ICLR), 2025.
-
Yihong Tang, Ao Qu, Zhaokai Wang, Dingyi Zhuang, Zhaofeng Wu, Wei Ma, Shenhao Wang, Yunhan Zheng, Zhan Zhao, and Jinhua ZhaoIn Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP Findings), 2025.
-
Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, and Ahmad BeiramiIn Empirical Methods in Natural Language Processing (EMNLP), 2024.
-
William Merrill*, Zhaofeng Wu*, Norihito Naka, Yoon Kim, and Tal LinzenIn Findings of the Annual Meeting of the Association for Computational Linguistics (ACL Findings), 2024.
-
Zhaofeng Wu, Linlu Qiu, Alexis Ross, Ekin Akyürek, Boyuan Chen, Bailin Wang, Najoung Kim, Jacob Andreas, and Yoon KimIn North American Chapter of the Association for Computational Linguistics (NAACL), 2024.[slides]
-
Yihong Tang, Zhaokai Wang, Ao Qu, Yihao Yan, Zhaofeng Wu, Dingyi Zhuang, Jushi Kai, Kebing Hou, Xiaotong Guo, Jinhua Zhao, Zhan Zhao, and Wei MaIn Empirical Methods in Natural Language Processing (EMNLP): Industry Track, 2024.
-
Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, and Noah A. SmithIn Transactions of the Association for Computational Linguistics (TACL), 2023.[slides] [poster]
-
Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, and Yejin ChoiIn Empirical Methods in Natural Language Processing (EMNLP), 2023.
-
Zhaofeng Wu, Robert L. Logan IV, Pete Walsh, Akshita Bhagia, Dirk Groeneveld, Sameer Singh, and Iz BeltagyIn Empirical Methods in Natural Language Processing (EMNLP), 2022.[poster]
-
Zhaofeng Wu, Hao Peng, Nikolaos Pappas, and Noah A. SmithIn Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP Findings), 2022.[poster]
-
Hao Peng, Jungo Kasai, Nikolaos Pappas, Dani Yogatama, Zhaofeng Wu, Lingpeng Kong, Roy Schwartz, and Noah A. SmithIn Annual Meeting of the Association for Computational Linguistics (ACL), 2022.
-
Zhaofeng WuUnpublished manuscript, 2022.
-
Zhaofeng Wu, Hao Peng, and Noah A. SmithIn Transactions of the Association for Computational Linguistics (TACL), 2021.[slides]
-
Zhaofeng Wu and Matt GardnerIn Workshop on Computational Models of Reference, Anaphora and Coreference @ EMNLP, 2021.[slides]
-
Zhaofeng Wu, Ding Zhao, Qiao Liang, Jiahui Yu, Anmol Gulati, and Ruoming PangIn IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.[slides]
-
Zhaofeng Wu, Yan Song, Sicong Huang, Yuanhe Tian, and Fei XiaIn BioNLP Workshop and Shared Task @ ACL, 2019.
Talks
Data-General Computation in Language Models
Stanford, University of Chicago, UIUC
April – July 2025
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
USC, MIT
November 2024 – April 2025
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
Brown
April 2024
Generalization in the LLM Era
University of Utah, University of Virginia CS 6501
March 2024
Language Models: A Reality Check
Princeton, Cornell, NYU, UIUC CS 598, Google
September – November 2023
Graph Neural Networks for NLP
UW CSE 481N
May 2021