You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a summary of easily available, high-quality datasets consisiting of captioned image files for generalized DALLE-pytorch training (https://github.com/lucidrains/DALLE-pytorch).
The scripts help you download and resize the files from the given sources.
general datasets
Conceptual Images 12m
Wikipedia
Filtered yfcc100m
Open Images
specific datasets
None yet
Helper scripts
All helper scripts can be found in the utilities folder now:
TFrecords to WebDataset converter
Image-Text-Folder to WebDataset converter
Dataset sanitycheck for image-text-files
Example reader for WebDataset files
Sanitycheck for downloaded datasets
The following command will look for image-text-pairs (.jpg / .png / .bmp) and return a csv table with incomplete data.
When you add the optional argument -DEL, the incomplete files get deleted. The python scripts checks one folder and the first subdirectories.
Special thanks go to Romaine, who improved the download scripts and made the great WebDataset format more accessible with his continuous coding efforts! 🙏