You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ICLR2021] CT-Net: Channel Tensorization Network for Video Classification
@inproceedings{
li2021ctnet,
title={{\{}CT{\}}-Net: Channel Tensorization Network for Video Classification},
author={Kunchang Li and Xianhang Li and Yali Wang and Jun Wang and Yu Qiao},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=UoaQUQREMOs}
}
Overview
[2021/6/3] We release the PyTorch code of CT-Net. More details and models will be available.
Model Zoo
All models can be trained on a single machine (e.g., 8 1080Ti). Some tricks will help you save GPU memory, suck as mixed precision or torch.utils.checkpoint.
Some models are lost after hacking by mining malware. If there is any problem about training model, please create an issue or send me an email.
Now we release the model for visualization (Something-Something V1), please download it from here and put it in ./model. (passward: t3to)
Install
pip install -r requirements.txt
Dataset
In our paper, we conduct experiments on Kinetics-400, Something-Something V1&V2, UCF101, and HMDB51. Please refer to TSM repo for the detailed guide of data pre-processing.
Training and Testing
Please refer to scripts/train.sh and scripts/test.sh, more details can be found in the appendix of our paper.
Setting environment
source ./init.sh
Training
We use dense sampling and uniform sampling for Kinetics and Something-Something respecitively.