You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A list of modern (convolutional) network architectures for vision. Note that we only list the works based on convolution, modulation or other variants that emerge most recently. Please refer to other more comprehensive lists about networks using attention or MLP-style designs.
Since it is a new trend, so feel free to submit a pull request or raise an issue if you find any missed papers!
On the Connection between Local Attention and Dynamic Depth-wise Convolution. ICLR 2022.
Qi Han, Zejia Fan, Qi Dai, Lei Sun, Ming-Ming Cheng, Jiaying Liu, Jingdong Wang.
Release date: 8 June 2021.
[paper][code]
MetaFormer Is Actually What You Need for Vision. CVPR 2022.
Weihao Yu, Mi Luo, Pan Zhou, Chenyang Si, Yichen Zhou, Xinchao Wang, Jiashi Feng, Shuicheng Yan.
Release date: 22 Nov 2021.
[paper][code]
A ConvNet for the 2020s. CVPR 2022.
Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie.
Release date: 10 Jan 2022.
[paper][code]
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition. arXiv 2022.
Qibin Hou, Cheng-Ze Lu, Ming-Ming Cheng, Jiashi Feng.
Release date: 22 Nov 2022.
[paper][code]
A Close Look at Spatial Modeling: From Attention to Convolution. arXiv 2022.
Xu Ma, Huan Wang, Can Qin, Kunpeng Li, Xingchen Zhao, Jie Fu, Yun Fu.
Release date: 23 Dec 2022.
[paper][code]
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. arXiv 2023.
Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie.
Release date: 2 Jan 2023.
[paper][code]
Scaling up Kernels in 3D CNNs. arXiv 2022.
Yukang Chen, Jianhui Liu, Xiaojuan Qi, Xiangyu Zhang, Jian Sun, Jiaya Jia.
Release date: 21 June 2022.
[paper][code]
Long Range Pooling for 3D Large-Scale Scene Understanding. arXiv 2023.
Xiang-Li Li, Meng-Hao Guo, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu
Release date: 17 Jan 2023.
[paper][code]
Others
LKD-Net: Large Kernel Convolution Network for Single Image Dehazing. arXiv 2022.
Pinjun Luo, Guoqiang Xiao, Xinbo Gao, Song Wu.
Release date: 5 Sep 2022.
[paper][code]