You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Before starting the training process, ensure the dataset is correctly prepared. Please download the MSRVTT dataset locally first, you can get it through this link MRSVTT.
Then use the following script to organize and preprocess your data for training:
python make_dataset/msrvtt-depth-map.py
This script handles dataset cleaning, and formatting to meet the training requirements.
2. Training ControlNet (train_controlnet)
To train ControlNet using a single GPU, you can directly run the following script:
bash train_controlnet.sh
For multi-GPU training, make the necessary modifications in train_controlnet.sh, and refer to the example below:
To accelerate web connections, the train_ddpm.sh script uses proxychains. If you encounter connection issues, verify your proxy settings. For multi-GPU training, you can modify the script and use a similar setup as shown in the ControlNet example above.
Visualization
This section shows some visualization results on downstream algorithms.
We provide a comparison of the effects of different timesteps of the training and inference process for you to pick the desired parameters.