| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 12 Aug 2025 16:39:45 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"689b6e51-4cb8"
expires: Sun, 28 Dec 2025 20:19:05 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 3718:2F7ECD:7F9798:8F1D1E:69518E5F
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 20:09:05 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210075-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766952546.593857,VS0,VE202
vary: Accept-Encoding
x-fastly-request-id: cecf9a2a24a4a9204d979ffb0ce29f769d7360bf
content-length: 5445
Fuxiao Liu
Fuxiao Liu
Fuxiao Liu (刘赋骁)
I am a Research Scientist at NVIDIA. I obtained my Ph.D. from University of Maryland, College Park in May 2025, under the supervision of Abhinav Shrivastava, Yaser Yacoob, Tianyi Zhou and Furong Huang.
My recent focus is on building customizable large models that follow humans' intent.
Google Scholar/ LinkedIn/Github/Twitter/Instagram
Experience
- [Spring 2024] Nvidia ADLR, with Guilin Liu and Zhiding Yu on building Large Multimodal Models:[ICLR'25]
- [Summer 2023] Tencent AI, with Xiaoyang Wang, Jianshu Chen, Kaiqiang Song, Wenlin Yao on Visual Chart Understanding: [NAACL'24].
- [Spring 2023] Microsoft Research, with Linjie Li, Kevin Lin, Jianfeng Wang on Robust Visual instruction tunning: [ICLR'24],[CVPR'24].
- [Summer 2022] Adobe Research, with Chris Tensmeyer, Hao Tan and Ani Nenkova on Visual document Understanding: [ICPRAI'24].
- [Spring 2022] UMIACS, with Abhinav Shrivastava on Fact Checking on Short Video: [EACL'23].
- [Spring 2021] UVa Vislang Lab, with Vicente Ordonez on News Image Captioning: [EMNLP'21].
Selected Publications
-
New! EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Fuxiao Liu*, Min Shi*, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu*, Guilin Liu* -
New! Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Fuxiao Liu*, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang -
New! HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Fuxiao Liu*, Tianrui Guan*, Xiyang Wu, Ruiqi Xian, Xijun Wang, Zongxia Li, Lichang Chen, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou -
New! MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
Fuxiao Liu*, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu -
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
Fuxiao Liu*, Hao Tan, Chris Tensmeyer -
COVID-VTS: Fact Extraction and Verification on Short Video Platforms
Fuxiao Liu*, Yaser Yacoob, Abhinav Shrivastava -
Visual News: Benchmark and Challenges in News Image Captioning
Fuxiao Liu*, Yinghan Wang, Tianlu Wang, Vicente Ordonez
Other Publications
- [NAACL 2025] Large language models and causal inference in collaboration: A comprehensive survey.
Xiaoyu Liu, Paiheng Xu, Junda Wu, Yuhang Zhou Fuxiao Liu*, Tianrui Guan, Haoliang Wang, Tong Yu, Julian McAuley, Wei Ai, Furong Huang
- [NeurIPS 2024 Workshop] Towards understanding in-context learning with contrastive demonstrations and saliency maps.
Fuxiao Liu*, Paiheng Xu, Zongxia Li
- [LREC-COLING 2024] From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond.
Fao Fei, Yuan Yao, Zhuosheng Zhang, Fuxiao Liu*, Ao Zhang, Tat-seng Chua
- [ACL 2024] Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Fuxiao Liu*, Mohit Bansal, Furong Huang
- [IROS 2025] On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities.
Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu*, Brian Sadler, Dinesh Manocha, Amrit Singh Bedi
- [IROS 2024] SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition.
Xijun Wang*, Ruiqi Xian, Tianrui Guan, Fuxiao Liu*,Dinesh Manocha
- [ICMLA 2024] DeepFM-CRISPR: Enhancing CRISPR On-Target Prediction with Deep Learning.
Condy Bao, Fuxiao Liu*
- [ACL 2025] Mosaic IT: Enhancing Instruction Tuning with Data Mosaics.
Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu*, Tianyi Zhou
- [CVPR 2025 Workshop] A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges.
Zongxia Li, Xiyang Wu, Hongyang Du, Fuxiao Liu*, Huy Nghiem, Guangyao Shi
Service
-
Conference Reviewer: AISTATS,CVPR,NAACL,ACL,IJCAI,ACMMM
Journal Reviewer: JMIR
More About Myself
-
I'm crazy about basketball since I was a little boy. I love it for its ultimate technical and mentality requirements.
No one in the world can become a master without great talent and extensive training.
My favorite basketball player is Kobe Bryant, who is noted for his rapid playing style, strong will, and his ambivalent relationship with the sport.
I am always immersed in his phenomenal performance in the game.
School of Computer Science
University of Maryland, College Park
University of Maryland, College Park