Carview!

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Thu, 18 Dec 2025 00:51:30 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"69435012-33ce" expires: Sun, 28 Dec 2025 13:37:51 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: E88E:292AC1:7A4D9C:891EDC:6951304F accept-ranges: bytes age: 0 date: Sun, 28 Dec 2025 13:27:51 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210058-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1766928471.206399,VS0,VE209 vary: Accept-Encoding x-fastly-request-id: a4d57f251ef6a94c0ee85d5a90c90bb6e0a96cd0 content-length: 2419 Jonathan Zheng

Jonathan Zheng

I am a second year PhD student at the Georgia Institute of Technology majoring in computer science. I have been working with Wei Xu and Alan Ritter since my second undergraduate year.

My research interests are in Natural Language Processing and Machine Learning, focusing on large language model (LLM) learning and generalizability. I am also interested in using LLMs for practical applications for social good, such as for misinformation detection and privacy risk detection.

Email / CV / Github / Google Scholar / Twitter

Research

I'm interested in the learning and reasoning process of large language models. My previous projects have explored the generalizability and robustness of NLP systems in diverse semantic spaces containing misinformation, language model representations of neologisms emerging over time, and probabilistic reasoning of LLMs in real-world applications.

Publications

Probabilistic Reasoning with LLMs for Privacy Risk Estimation
Jonathan Zheng, Sauvik Das, Alan Ritter, Wei Xu
NeurIPS 2025
arXiv

Privacy Risk Estimation is a new probablistic reasoning task that evaluates the capabilities of LLMs in using real world statistics to estimate the identification risk of user-generated documents containing privacy-sensitive information.

NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms
Jonathan Zheng, Alan Ritter, Wei Xu,
ACL, 2024
arXiv

Neo-Bench is a novel benchmark that evaluates the capabilities of LLMs in generalizing on new words that emerge over time..

Stanceosaurus 2.0: Classifying Stance Towards Russian and Spanish Misinformation
Anton Lavrouk, Ian Ligon, Tarek Naous, Jonathan Zheng, Alan Ritter Wei Xu,
W-NUT, 2024
arXiv

Stanceosaurus 2.0 extends the previous version by collecting Russian and Spanish tweets annotated with stance towards claims to combat misinformation online, especially for the ongoing conflict in Ukraine.

Stanceosaurus: Classifying Stance Towards Multilingual Misinformation
Jonathan Zheng, Ashutosh Baheti, Tarek Naous, Wei Xu, Alan Ritter
EMNLP, 2022
arXiv

Stanceosaurus is a large corpus of English, Hindi, and Arabic tweets annotated with stance towards claims to combat misinformation online.

Original Source | Taken Source