| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://wusize.github.io/publications/
access-control-allow-origin: *
strict-transport-security: max-age=31556952
expires: Wed, 31 Dec 2025 00:41:57 GMT
cache-control: max-age=600
x-proxy-cache: MISS
x-github-request-id: 608C:2916CC:AA18BA:BF2A67:69546EFD
accept-ranges: bytes
age: 0
date: Wed, 31 Dec 2025 00:31:57 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210032-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767141118.538596,VS0,VE221
vary: Accept-Encoding
x-fastly-request-id: 7afd821a90ff09808ae27bb499f8d53708c9433e
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 25 Jun 2025 23:56:45 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"685c8cbd-2fb0"
expires: Wed, 31 Dec 2025 00:41:57 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: DE28:2DDCFF:A91C72:BE2AC3:69546EFD
accept-ranges: bytes
age: 0
date: Wed, 31 Dec 2025 00:31:57 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210032-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767141118.776965,VS0,VE214
vary: Accept-Encoding
x-fastly-request-id: 8cfda9a7a11e687dd21a862cc30e195239a8aba2
content-length: 3332
Papers - Size Wu
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li and Chen Change Loy
International Conference on Computer Vision (ICCV) , 2025
[Paper] [Code]
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu, Sheng Jin, Wenwei Zhang, Lumin Xu, Wentao Liu, Wei Li and Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2025
[Paper] [Code]
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu and Chen Change Loy
International Conference on Learning Representations (ICLR) , 2024
Spotlight (top 5%)
[Paper] [Code]
CLIM: Contrastive Language-Image Mosaic for Region Representation
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu and Chen Change Loy
Association for the Advancement of Artificial Intelligence (AAAI) , 2024
[Paper] [Code]
Aligning Bag of Regions for Open-Vocabulary Object Detection
Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2023
[Paper] [Code] [Project Page]
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, and Wanli Ouyang
International Conference on Computer Vision (ICCV) , 2021
[Paper] [Code]
Papers
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li and Chen Change Loy
International Conference on Computer Vision (ICCV) , 2025
[Paper] [Code]
Size Wu, Sheng Jin, Wenwei Zhang, Lumin Xu, Wentao Liu, Wei Li and Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2025
[Paper] [Code]
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu and Chen Change Loy
International Conference on Learning Representations (ICLR) , 2024
Spotlight (top 5%)
[Paper] [Code]
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu and Chen Change Loy
Association for the Advancement of Artificial Intelligence (AAAI) , 2024
[Paper] [Code]
Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2023
[Paper] [Code] [Project Page]
Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, and Wanli Ouyang
International Conference on Computer Vision (ICCV) , 2021
[Paper] [Code]
