| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Fri, 24 Oct 2025 03:44:18 GMT
access-control-allow-origin: *
etag: W/"68faf612-e2b0"
expires: Sun, 28 Dec 2025 06:13:53 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 320F:123DE:743C1F:824596:6950C846
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 06:03:53 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210054-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766901833.895534,VS0,VE204
vary: Accept-Encoding
x-fastly-request-id: e36e71e5530a5e4a7e27a5fe51f89a13139d9468
content-length: 8921
Shuang Li

Selected Publications
[Full List]
(
show by date /
show by topic)
Shuang Li
Senior Research Scientist
Google DeepMind
Email: shuangligoogle [at] google (dot) com
Google Scholar /
Twitter
I am a Senior Research Scientist at Google DeepMind. Previously, I was a founding member of Voyage AI, a language-embedding startup acquired by MongoDB. I did my postdoc at Stanford with Shuran Song and Dorsa Sadigh, and completed my Ph.D. at MIT advised by Antonio Torralba. My research focus on world modeling and robot learning.
Recent Talks
- [2025/5] Keynote talk at CVPR Workshop Foundation Models Meet Embodied Agents 2025: Vision and Language Models for Decision-Making
- [2025/3] Invited talk at AAAI 2025: How Vision and Language Models Are Changing Decision-Making
- [2023/12] Invited Talk at NeurIPS 2023 Workshop on Diffusion Models
- [2023/8] Invited Talk at Google
- [2023/8] Diffusion Models Course at SIGGRAPH
- [2023/5] Invited Talk at UC Berkeley
- [2023/4] Invited Talk at Nanyang Technological University
- [2023/2] Invited Talk at Columbia
- [2023/2] Invited Talk at UIUC
- [2023/2] Invited Talk at UT Austin
- [2022/12] Invited Talk at UC San Diego
- [2022/11] Invited Talk at Cornell Tech
- [2022/11] Invited Talk at UC Berkeley
- [2022/11] Invited Talk at Stanford
- [2022/11] Invited Talk at Northeastern
Datasets and Environments
- VirtualHome multi-agent embodied environment for household activities
- V-HICO dataset for video-based human-object interaction detection
- CUHK-SYSU dataset for person re-identification
- CUHK-PEDES dataset for natural language based person re-identification
Selected Publications
[Full List]
(
show by date /
show by topic)
Research Topics:
Generative Modeling /
Robot Learning
Selected Honors
Workshops / Tutorials Organizer
- Visual Generative Modeling: What’s After Diffusion? at CVPR 2025
- Continual Robot Learning from Humans at RSS 2025
- Knowledge in Generative Models at ECCV 2024
- Diffusion Models Course at SIGGRAPH 2023
- Social Intelligence in Humans and Robots at ICRA 2021, RSS 2022, RSS 2023
-
MIT Visual Computing Workshop 2021
- Wider Face and Person Challenge at ICCV 2019