You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[2024.09.19] Checkpoints have been migrated to Hugging Face. You can obtain weights from VideoMAEv2-hf. [2023.05.29] VideoMAE V2-g features for THUMOS14 and FineAction datasets are available at TAD.md now. [2023.05.11] We have supported testing of our distilled models at MMAction2 (dev version)! See PR#2460. [2023.05.11] The feature extraction script for TAD datasets has been released! See instructions at TAD.md. [2023.04.19] ViT-giant model weights have been released! You can get the download links from MODEL_ZOO.md. [2023.04.18] Code and the distilled models (vit-s & vit-b) have been released! [2023.04.03]Code and models will be released soon.
Model Zoo
We now provide the model weights in MODEL_ZOO.md. We have additionally provided distilled models in MODEL_ZOO.
If you find this repository useful, please use the following BibTeX entry for citation.
@InProceedings{wang2023videomaev2,
author = {Wang, Limin and Huang, Bingkun and Zhao, Zhiyu and Tong, Zhan and He, Yinan and Wang, Yi and Wang, Yali and Qiao, Yu},
title = {VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2023},
pages = {14549-14560}
}
@misc{videomaev2,
title={VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking},
author={Limin Wang and Bingkun Huang and Zhiyu Zhao and Zhan Tong and Yinan He and Yi Wang and Yali Wang and Yu Qiao},
year={2023},
eprint={2303.16727},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
About
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking