CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Wed, 03 Dec 2025 18:08:08 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"69307c88-22a8" expires: Mon, 29 Dec 2025 19:49:12 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 36DC:2F7ECD:942D72:A62F20:6952D8E0 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 19:39:12 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210070-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767037153.544395,VS0,VE203 vary: Accept-Encoding x-fastly-request-id: b51d585f284c323db17f0727d7595408e8db0c19 content-length: 3284 Siyi Liu - University of Pennsylvania

Siyi Liu

siyiliu@seas.upenn.edu
3401 Walnut St, Philadelphia, PA

Google Scholar
Curriculum Vitae

About

I'm final-year PhD student at Department of Computer and Information Science, University of Pennsylvania, advised by Prof. Dan Roth

My research spans natural language processing and broader artificial intelligence. Recently, I’m particularly interested in studying different types of conflicts that emerge in modern AI systems, including:

hallucinations, factuality and alignment - model generations that are non-factual, toxic, or unsubstatiated/contradictory to the contexts [2][3]
retrieval, knowledge conflicts, and reasoning - How can LLMs reason through conflicting knowledge from different sources (contextual/parametric knowledge)? [1][4]
perspectives and biases - conflicts across different perspectives and opinions [7] [8] [9]

Some of my other ongoing projects include:

Teaching LLMs to learn from failures/feedback
Deepfake Video Detection
Representation Learning for Authorship Representation

Some of my other past projects include:

Open Source Project (Core Contributor) - Distilled Generative Text Embedding Model
Framing Bias Detection

Selected Publications

[1] ConflictScore: Measuring How Language Models Handle Conflicting Evidence
Siyi Liu, Patrick Xia, et al.
In submission

[2] DeeptraceReward: Learning Human-Perceived Fakeness in Generated Videos with Multimodal LLMs
Xingyu Fu, Siyi Liu, et al.
Neurips GenProCC Workshop 2025

[3] Towards Long Context Hallucination Detection
Siyi Liu , Kishaloy Halder, et al.
NAACL 2025 Findings

[4] Open Domain Question Answering with Conflicting Contexts
Siyi Liu , Qiang Ning, et al.
NAACL 2025 Findings

[5] Using LLM for improving key event discovery: Temporal-guided news stream clustering with event summaries
Nishanth Nakshatri, Siyi Liu, Sihao Chen, Daniel Hopkins, Dan Roth, Dan Goldwasser
EMNLP 2023

[6] Open-Domain Event Graph Induction for Mitigating Framing Bias
Siyi Liu, Hongming Zhang, Hongwei Wang, Kaiqiang Song, Dan Roth, Dong Yu
arXiv

[7] Design Challenges for a Multi-Perspective Search Engine
Sihao Chen*, Siyi Liu* , Xander Uyttendaele, Yi Zhang, William Bruno, Dan Roth
NAACL 2022 Findings

[8] MultiOpEd: A Corpus of Multi-Perspective News Editorials
Siyi Liu, Sihao Chen, Xander Uyttendaele, Dan Roth
NAACL 2021
[Code] [Slides] [Poster] [Talk]

[9] Detecting frames in news headlines and its application to analyzing news framing trends surrounding US gun violence
Siyi Liu, Lei Guo, Kate Mays, Margrit Betke, Derry Tanti Wijaya
CoNLL 2019

[10] Learning to mirror speaking styles incrementally
Siyi Liu*, Ziang Leng*, Derry Wijaya
arXiv