You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Modeling Image Composition for Complex Scene Generation
Official PyTorch implementation of of TwFA. Modeling Image Composition for Complex Scene Generation (CVPR2022)
Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao
Create a symlink data/coco containing the images from the 2017 split in
train2017 and val2017, and their annotations in annotations. Files can be
obtained from the COCO webpage.
VG
Create a symlink data/vg containing the images from Visual Genome. Files can be
obtained from the VG webpage. Unzip the other annotations for VG in the dir data.
Sampling
COCO
Download the checkpoint (code: 5ipt) and place it into the dir pretrained/checkpoints. Then run the command:
@misc{esser2020taming,
title={Taming Transformers for High-Resolution Image Synthesis},
author={Patrick Esser and Robin Rombach and Björn Ommer},
year={2020},
eprint={2012.09841},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
BibTeX
@inproceedings{yang2022modeling,
title={Modeling image composition for complex scene generation},
author={Yang, Zuopeng and Liu, Daqing and Wang, Chaoyue and Yang, Jie and Tao, Dacheng},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={7764--7773},
year={2022}
}