You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
β οΈ IMPORTANT NOTICE β οΈ : Both stage-I and stage-II are trained with long prompts only. For achieving the best results, include comprehensive and detailed descriptions in your prompts, akin to the example provided in example.txt.
Jupyter Notebook
You can conveniently provide user prompts in our Jupyter notebook. The default configuration for spatial and temporal slices in the VAE Decoder is tailored for an 80G GPU. For GPUs with less memory, one might consider increasing the spatial and temporal slice.
flashvideo/demo.ipynb
Inferring from a Text File Containing Prompts
You can conveniently provide the user prompt in a text file and generate videos with multiple gpus.
bashinf_270_1080p.sh
License
This project is developed based on CogVideoX. Please refer to their original license for usage details.
BibTeX
@article{zhang2025flashvideo,
title={FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation},
author={Zhang, Shilong and Li, Wenbo and Chen, Shoufa and Ge, Chongjian and Sun, Peize and Zhang, Yida and Jiang, Yi and Yuan, Zehuan and Peng, Binyue and Luo, Ping},
journal={arXiv preprint arXiv:2502.05179},
year={2025}
}
About
[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation