| CARVIEW |
I am a Senior Machine Learning Scientist on the Foundation Models team at Prescient Design, Genentech (Roche).
My work involves leading the development of agentic automation and intelligent platforms for molecular drug discovery and contributing to the training of scientific large language models.
I hold a PhD in Computer Science from UCLA, advised by Prof. Wei Wang. I’m an award recipient of the J.P. Morgan Chase AI PhD Fellowship and the Amazon Fellowship. My prior research experience includes work at Amazon AGI, USC, CUHK, UC Santa Cruz, MIT and PolyU.
I develop machine learning (ML) systems inspired by scientific data and expert tasks, equipping large language models (LLMs) with the intuition and knowledge of domain experts. My research introduces machine learning innovations and insights to enable a comprehensive spectrum of expertise acquisition, from explicit to implicit knowledge and from individual decision-making to the automation of complex expert workflows. Specifically, I focus on:
Extracting explicit knowledge from unstructured data in low-resource scenarios: dataset (ACL'23), library (NAACL'21), data-efficient (ACL'23) and parameter-efficient (INTERSPEECH'23) methods, indirect supervision (ACL'23, EMNLP-F'22), cross-document (ACL-F'23), data synthesis/augmentation for zero-shot scenarios (AAAI'24, ACL'24)
Capturing implicit expert intuition: LLMs’ clinical decision-making benchmark (preprint 24), rich supervision to model decision sequences (AAAI'25), conveying intuition with a decoding-free paradigm (ACL'25)
Compositional, project-level reasoning and automation: KG-inspired reasoning (ICML'25), cross-modality (NeurIPS'24), drug discovery agents, scientific workflow agent platform (preprint 25), material design (NAACL'25)
Fairness and safety of generative LLMs: unsupervised bias mitigation (NAACL'24), attacking LLM with data poisoning (NAACL'24), ownership protection (NAACL'24), LLMs’ clinical bias analysis (preprint 24)
Empowered expert applications: clinical diagnosis (preprint 24, ACL'25, preprint 24), health outcome prediction (AAAI'25), clinical event extraction (ACL'23), biomedical and scientific QAs (ICML'25, NeurIPS'24, NAACL'24, ACL'23), computational social science (EMNLP'24, AAAI'24), political event forecasting (preprint 24), dialogue state tracking (INTERSPEECH'23), knowledge structure/graph construction (EMNLP-F'21, AKBC'22)
Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection
GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation
Orchestrating Tool Ecosystem of Drug Discovery with Intention-Aware LLM Agents
SpatialAgent: An Autonomous AI Agent for Spatial Biology
BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents
Entropy-Based Adaptive Weighting for Self-Training
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction
How Californians Tweet about Extreme Heat Events on Social Media: A Health Equity Perspective
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
CliBench: A Multifaceted and Multigranular Evaluation of Large Language Models for Clinical Decision Making
GraphVis: Boosting LLMs with Visual Knowledge Graph Integration
Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach
Are Large-Language Models Graph Algorithmic Reasoners?
CLIMB: A Benchmark of Clinical Bias in Large Language Models
MIRAI: Evaluating LLM Agents for Event Forecasting
Improving Event Definition Following For Zero-Shot Event Detection
Mitigating Bias for Question Answering Models by Tracking Bias Influence
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models
Instructional Fingerprinting of Large Language Models
STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models
MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways
DICE: Data-Efficient Clinical Event Extraction with Generative Models
Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?
Multi-hop Evidence Retrieval for Cross-document Relation Extraction
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
Summarization as Indirect Supervision for Relation Extraction
Bending the Future: Autoregressive Modeling of Temporal Knowledge Graphs in Curvature-Variable Hyperbolic Spaces
HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning
EventPlus: A Temporal Event Understanding Pipeline
Dual Memory Network Model for Sentiment Analysis of Review Text
Implicit Discourse Relation Identification for Open-domain Dialogues
Dual Memory Network Model for Biased Product Review Classification
BlocHIE: a BLOCkchain-based platform for Healthcare Information Exchange
Experience

Prescient Design, Genentech (Roche)
Senior Machine Learning Scientist
Since 2024; New York, NY
Amazon Alexa AI
Applied Scientist Intern
2021 & 2022; Sunnyvale, CA
Education

University of California, Los Angeles

The Hong Kong Polytechnic University
Bachelor of Science in Computing (First Class Honours)
2014 - 2018; Hong Kong
Best Thesis Award, commencement speaker, advised by Prof. Qin Lu and Prof. Jiannong Cao
University of Maryland, College Park
Exchange Student
2016; College Park, MD
Awards
Teaching
Services
Lead Organizer Organizer: Area Chair:- ACL (2025), EMNLP (2025)
- ACL Rolling Review (2025, 2024, 2023, 2022, 2021), ICLR (2025), ACL (2024, 2023, 2022), EMNLP (2024, 2023, 2022, 2021), NAACL (2025, 2024, 2022), COLM (2024), KDD (2023), EACL (2023), NLPCC (2023, 2022)
- IEEE/ACM Transactions on Audio, Speech, and Language Processing (since 2023)
- NeurIPS workshop on Efficient Natural Language and Speech Processing (2024)
- AAAI Spring Symposium on Clinical Foundation Models (2024)
- EMNLP workshop on Deep Learning for Low-resources NLP workshop (2019)
- SoCal NLP Symposium (2023, 2022)