| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://hkchengrex.com/
x-github-request-id: D58F:444BC:7C883D:8B864E:695147BB
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 15:07:40 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210067-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766934460.486086,VS0,VE196
vary: Accept-Encoding
x-fastly-request-id: 43a399c15f1fe9cef727c039016f7a6e6bda071c
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Mon, 22 Dec 2025 10:13:23 GMT
access-control-allow-origin: *
etag: W/"694919c3-595c"
expires: Sun, 28 Dec 2025 15:17:40 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 2DC5:234FE9:7CF5E0:8BF516:695147BC
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 15:07:40 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210047-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766934461.784871,VS0,VE201
vary: Accept-Encoding
x-fastly-request-id: 97d3ea31031408750acd715fc30d99e0a2c733c2
content-length: 5234
Rex Cheng
Ho Kei (Rex) Cheng
I am a Ph.D. candidate at the University of Illinois Urbana-Champaign, advised by
I work on visual understanding, with a focus on videos. I have interned at Adobe Research, Kaiber, Sony AI, and FAIR/Meta MSL PAR.
Research (hover over videos to play)
Nicolas Carion, Laura Gustafson, Yuan-Ting Hu, Shoubhik Debnath, Ronghang Hu, Didac Suris, Chaitanya Ryali, Kalyan Vasudev Alwala, Haitham Khedr, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman Rädle, Triantafyllos Afouras, Effrosyni Mavroudi, Katherine Xu, Tsung-Han Wu, Yu Zhou, Liliane Momeni, Rishi Hazra, Shuangrui Ding, Sagar Vaze, Francois Porcher, Feng Li, Siyuan Li, Aishwarya Kamath, Ho Kei Cheng, Piotr Dollár, Nikhila Ravi, Kate Saenko, Pengchuan Zhang, Christoph Feichtenhofer.
arXiv 2025
A unified model for detection, segmentation, and tracking of objects in images and video using text, exemplar, and visual prompts.
Ho Kei Cheng,
Alexander Schwing.
ICCV 2025
Provides straighter flows through condition-aware coupling of samples from the prior and data distributions, without the test-time degradation induced by naïve optimal transport.
CVPR 2025
Generates high-quality synchronized audio from video or text inputs, with an architecture that enables training on data from multiple sources even when some modalities are missing.
Click here to watch a fun video!
CVPR 2024 Highlight
ICCV 2023
Achieves open-world video segmentation by combining universal image segmentation with temporal propagation. Easy to extend.
Ho Kei Cheng,
Alexander Schwing.
ECCV 2022
Approaches video object segmentation from a memory perspective with a pipeline that effectively models both short-term and long-term dependencies.
Used by supervisely and Track-Anything.
Ho Kei Cheng,
Yu-Wing Tai,
Chi Keung Tang.
CVPR 2021
Decouples interactive video segmentation into two components: single-frame interaction and temporal propagation, demonstrating significantly improved performance.
Used by Sieve.
CVPR 2020
An iterative refinement network that achieves high-quality 4K+ segmentation using only low-resolution training data (less than 500 pixels per side).
Invited Talks
- Object-Level Reasoning in Video Object Segmentation and Its Multimodal Applications @ Twelve Labs, September 2024
- Segmenting Videos in the Open World @ IBM Zurich, Accelerated Discovery, September 2023
- Large-Scale Decoupled Video Segmentation @ Apple, September 2023
Tools
nitrous-ema
Low overhead post-hoc EMA for PyTorch.
vos-benchmark
Fast and simple benchmarking for video object segmentation.
shared-memory-tensor-dataset
A simple demo for sharing an in-memory dataset among DDP processes.
Professional Activities
- Reviewed for: CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, AAAI, IEEE TIP, IEEE PR, IEEE TPAMI, IEEE TCSVT.
- Outstanding reviewer in ICML 2022.
Misc
- I was a proud member of the HKUST Robotics Team. A short clip.
-
I am generally interested in artificial intelligence. I believe in AGIs and has high hope for their potential to transform human civilization for the better.
- "Man is condemned to be free. Condemned, because he did not create himself, in other respect is free; because, once thrown into the world, he is responsible for everything he does."
- Look at this cat in HKUST. Another picture. Or this cat.
- "Ho Kei" (with the space) is my first name and "Cheng" is my last name. "Rex" is the commonly used "english name" that is not part of my legal name. This is common in Hong Kong.