Ultra-low Bitrate Video Compression Using Deep Animation Models

Welcome to the repository for Ultra-low Bitrate Video Compression Using Deep Animation Models. This codebase implements methods and models described in cutting-edge research on low-bitrate video conferencing and animation-based video compression. The repository is designed to serve researchers and developers interested in leveraging deep learning for video compression.

📚 Related Publications

This repository accompanies the following papers:

⚙️ Installation

This repository supports Python 3. To set up the environment, clone this repository and install the required dependencies:

pip install -r requirements.txt

📂 Assets

YAML Configuration

The YAML configuration files are used to define the settings for training and testing the models. Example files are located in the train/test config directory:

[train/test]_config/dac.yaml
[train/test]_config/hdac.yaml
[train/test]_config/rdac.yaml

During inference, use the --mode test flag with the same configuration file after updating the eval_params section appropriately.

Datasets

VoxCeleb: Follow the instructions in the video-preprocessing repository to prepare the dataset.
Creating Your Own Videos: Ensure that input videos are cropped to focus on the speaker’s face at a resolution of 256x256 pixels. (Support for higher resolutions is under development.)
Pre-processed Videos (256x256 px): Pre-processed videos are available for download from our Google Drive link. Place these videos in the following folders:
- datasets/train
- datasets/inference

Evaluation Metrics

Our metrics module incorporates suggestions from JPEG-AI alongside popular quantitative metrics used in computer vision. Supported metrics include:

psnr, psnr-hvs, fsim, iw_ssim, ms_ssim
vif, nlpd, vmaf, lpips, msVGG

🚀 Training

To train a model, update the relevant parameters in the corresponding train_config/[MODEL_NAME].yaml file or use the default configuration (to reproduce our results). Run the following command:

bash training_script.sh [MODEL_NAME]

Note: The default setup requires 2 x A40 GPUs. Adjust the batch size in the configuration file if using a different hardware setup.

🧪 Testing

To test a model, update the eval_params in the corresponding test_config/[MODEL_NAME].yaml file and run:

bash test_script.sh [MODEL_NAME]

Refer to JVET_AH0114 and subsequent documentation and Reference software for CTC implementations and benchmark evaluation against other GFVC frameworks.

🙏 Attributions

This codebase includes components adapted from the following projects:

First Order Motion Model for Image Animation: For the base architecture of deep image animation using unsupervised keypoints.
CompressAI: For learned image compression.
JPEG-AI: For evaluation metrics.

📬 Contact

For any questions, feedback, or collaboration opportunities, feel free to contact the maintainers or open an issue in this repository.

🌟 Acknowledgments

We appreciate the contributions of the research community that enabled this work. If you use this repository or find it helpful, please consider citing the relevant papers.

⭐ Star This Repository

If you find this project useful, give it a star on GitHub to support further development!

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
models		models
test_config		test_config
train_config		train_config
trainers		trainers
utilities		utilities
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py
test.py		test.py
test_script.sh		test_script.sh
train.py		train.py
training_script.sh		training_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ultra-low Bitrate Video Compression Using Deep Animation Models

📚 Related Publications

⚙️ Installation

📂 Assets

YAML Configuration

Datasets

Evaluation Metrics

🚀 Training

🧪 Testing

Refer to JVET_AH0114 and subsequent documentation and Reference software for CTC implementations and benchmark evaluation against other GFVC frameworks.

🙏 Attributions

📬 Contact

🌟 Acknowledgments

⭐ Star This Repository

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Goluck-Konuko/animation-based-codecs

Folders and files

Latest commit

History

Repository files navigation

Ultra-low Bitrate Video Compression Using Deep Animation Models

📚 Related Publications

⚙️ Installation

📂 Assets

YAML Configuration

Datasets

Evaluation Metrics

🚀 Training

🧪 Testing

Refer to JVET_AH0114 and subsequent documentation and Reference software for CTC implementations and benchmark evaluation against other GFVC frameworks.

🙏 Attributions

📬 Contact

🌟 Acknowledgments

⭐ Star This Repository

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages