Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

This is the official PyTorch implementation of Drop-DTW [1] (published at NeurIPS'21). The code includes the core Drop-DTW algorithm as well as the step localization experiments on the COIN dataset [2].

Set up the data

(a) Download pre-extracted features for the COIN dataset by running download_coin_features.sh in the root folder of the project. The features are extracted using the S3D net pretrained on HowTo100M [3]; OR (b) If for some reason you do not want to use pre-extracted features but instead you want to extract the features yourself, please follow the instructions in video_encoding/. This step is performed instead of step 1a.
In the terminal where you are going to run training/testing, run the following command first:
```
ulimit -n 5000
```
This sets the number of simultaneously open files to 5000 which is important to make the data loader function properly.

Train the network

In order to train a feature mapping with Drop-DTW loss (using 0.3 percentile drop-cost) run the following command:

python3 train.py --name=my_model --keep_percentile=0.3

Inspect train.py for possible additional training configurations, such network architecture changes, learnable drop cost and many more.

Step localization inference

To test your model's ability to do step localization on the CONI dataset, run the following code:

python3 evaluate.py --name=my_model

You can change the inference method from Drop-DTW to some other algorithms and alter other testing settings using flags. Please, refer to evaluate.py for more details.

References

[1] Dvornik et al. "Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers." NeurIPS'21. [2] Tang et al. "COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis." CVPR'19 [3] - Miech et al. "End-to-end learning of visual representations from uncurated instructional videos." CVPR'20.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
data		data
dp		dp
eval		eval
models		models
video_encoding		video_encoding
ME_input_output.json		ME_input_output.json
ME_input_output_YC2_model.json		ME_input_output_YC2_model.json
ME_input_output_YC2_new.json		ME_input_output_YC2_new.json
README.md		README.md
download_coin_featres.sh		download_coin_featres.sh
evaluate.py		evaluate.py
paths.py		paths.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

Set up the data

Train the network

Step localization inference

References

About

Uh oh!

Releases

Packages

Languages

SamsungLabs/Drop-DTW

Folders and files

Latest commit

History

Repository files navigation

Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

Set up the data

Train the network

Step localization inference

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages