Exporters From Japan
Wholesale exporters from Japan   Company Established 1983
CARVIEW
Select Language

Selected Works ([Full List])

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang*, Jifeng Dai*, Zhe Chen*†, Zhenhang Huang* Zhiqi Li*†, Xizhou Zhu*, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao#
CVPR, 2023 (Highlight Paper (2.5%))
[Paper] [Code] [BibTex]
A strong large-scale CNN-based fondamention model.
Vision Transformer Adapter for Dense Predictions
Zhe Chen*†, Yuchen Duan*†, Wenhai Wang#, Junjun He, Tong Lu#, Jifeng Dai, Yu Qiao
ICLR, 2023 (Spotlight Paper (8.0%))
[Paper] [Code] [BibTex]
We design a ViT adapter for dense prediction tasks.
BEVFormer: Learning Bird’s-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li*†, Wenhai Wang*, Hongyang Li*, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai#
ECCV, 2022
[Paper] [Code] [BibTex]
[ECCV 2022' Top-10 Influential Papers]
[100 Most Cited AI Papers in 2022]
A versatile camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
PVT v2: Improved Baselines with Pyramid Vision Transformer
Wenhai Wang#, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
CVMJ, 2021 (ESI Highly Cited Paper (1%), ESI Hot Paper (0.1%))
[Paper] [Code] [中文解读] [Report] [Talk] [BibTex]
[CNKI's Academic Essentials]
[CVMJ 2022 Honorable Mention Award]
A better PVT.
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan#, Kaitao Song, Ding Liang, Tong Lu#, Ping Luo, Ling Shao
ICCV, 2021 (Oral Presentation (3.4%))
[Paper] [Code] [中译版] [中文解读] [Report] [Talk] [BibTex]
[ICCV21' Top-10 Influential Papers]
A pure Transformer backbone for dense prediction, such as object detection and semantic segmentation.
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond
Enze Xie*, Wenhai Wang*, Mingyu Ding, Ruimao Zhang, Ping Luo#
TPAMI, 2021
[Paper] [Code] [BibTex]
[CVPR 2020 Top-10 Influential Papers]
We extend PolarMask (CVPR 2020 Oral Presentation (5.7%)) to several instance-level detection tasks.
PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
Wenhai Wang*, Enze Xie*, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu#, Chunhua Shen
TPAMI, 2021
[Paper] [Code1] [Code2] [BibTex]
We extend PSENet (CVPR 2019) and PAN (ICCV 2019) to a text spotting system.

Honors and Awards

Invited Talk

  • 2023/10: Preliminary Study on "Large-Scale Visual Foundation Model + LLM in Open-World Application", PRCV Talk.
  • 2023/08: Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions, WAIC Youth Outstanding Paper Award Talk
  • 2023/07-08: Preliminary Study on "Large-Scale Visual Foundation Model + LLM", Zhidx/Huawei Noah's Ark Lab/Tencent Youtu Lab/Fudan University Talk
  • 2023/06: Study and Application of Large-scale Foundation Models in Open World Tasks, VALSE Talk
  • 2023/05: InternImage: A Large-Scale Generic Vision Model, SenseTime Talk
  • 2022/11-12: Study and Application of Multi-Task Generic Perception Model, AITIME (2:31:30)/Tsinghua University Talk
  • 2022/07: Transformer-based Vision Perception, ChinaMM Talk
  • 2021/07: Application of Transformer in Detection and Segmentation Tasks, TechBeat Talk

Academic Services

Workshop (Co-)Organizer
  • Vision and Language Collision: Synergy between Language Model and Vision Ecology (视言碰撞:语言模型与视觉生态协同) at PRCV 2023
  • Challenges and Opportunities of Large Models for CV/PR (大模型对CV/PR的挑战与机会) at VALSE 2023
Associate Editor
  • Visual Intelligence
Senior Program Committee Member
  • International Joint Conference on Artificial Intelligence (IJCAI), 2021
Journal Reviewer
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • International Journal of Computer Vision (IJCV)
  • IEEE Transactions on Image Processing (TIP)
  • IEEE Transactions on Multimedia (TMM)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • Computational Visual Media Journal (CVMJ)
  • Pattern Recognition (PR)
Program Committee Member/Conference Reviewer
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 2021, 2022, 2023
  • Neural Information Processing Systems (NeurIPS), 2020, 2021, 2023
  • International Conference on Machine Learning (ICML), 2021, 2022
  • International Conference on Learning Representations (ICLR), 2021
  • IEEE International Conference on Computer Vision (ICCV), 2021
  • European Conference on Computer Vision (ECCV), 2022
  • AAAI Conference on Artificial Intelligence (AAAI), 2022
  • International Joint Conference on Artificial Intelligence (IJCAI), 2022
  • IEEE Winter Conference on Applications of Computer Vision (WACV), 2021
  • Asian Conference on Computer Vision (ACCV), 2020