Image Captioning and Text-to-Image Synthesis with textual data augmentation

This code run well under python2.7 and TensorFlow 0.11, if you use higher version of TensorFlow you may need to update the tensorlayer folder from TensorLayer Lib.

Usage

1. Prepare MSCOCO data and Inception model

Before you run the scripts, you need to follow Google's setup guide, and setup the model, ckpt and data directories in *.py.

Creat a data folder.
Download and Preprocessing MSCOCO Data click here.
Download the Inception_V3 CKPT click here.

2. Train image captioning model

Train your image captioning model on MSCOCO by following my other repo.

3. Setup your paths

in train_im2txt2im_coco_64.py
config your image directory here images_train_dir = '/home/.../mscoco/raw-data/train2014/'
config the vocabulary and model of you image captioning module DIR = "/home/..."
directory containing model checkpoints. CHECKPOINT_DIR = DIR + "/model/train"
vocabulary file generated by the preprocessing script. VOCAB_FILE = DIR + "/data/mscoco/word_counts.txt"

4. Train text-to-image synthesis with image captioning

model_im2txt.py model for image captioning
train_im2txt2im_coco_64.py script for training I2T2I
utils.py script for utility functions

Results

1. Here are some results on MSCOCO

2. Transfer learning on MHP dataset

Citation

If you find it is useful, please cite:

@article{hao2017im2txt2im,
  title={I2T2I: LEARNING TEXT TO IMAGE SYNTHESIS WITH TEXTUAL DATA AUGMENTATION},
  author={Hao Dong, Jingqing Zhang, Douglas McIlwraith, Yike Guo},
  journal={ICIP},
  year={2017}
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
img		img
tensorlayer		tensorlayer
.gitignore		.gitignore
README.md		README.md
_rnn_encoder_tsne.py		_rnn_encoder_tsne.py
model_im2txt.py		model_im2txt.py
run_inference_im2txt_batch.py		run_inference_im2txt_batch.py
train_im2txt2im_coco_64.py		train_im2txt2im_coco_64.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Captioning and Text-to-Image Synthesis with textual data augmentation

Usage

1. Prepare MSCOCO data and Inception model

2. Train image captioning model

3. Setup your paths

4. Train text-to-image synthesis with image captioning

Results

1. Here are some results on MSCOCO

2. Transfer learning on MHP dataset

Citation

About

Uh oh!

Releases 2

Packages

Contributors 2

Uh oh!

Languages

zsdonghao/im2txt2im

Folders and files

Latest commit

History

Repository files navigation

Image Captioning and Text-to-Image Synthesis with textual data augmentation

Usage

1. Prepare MSCOCO data and Inception model

2. Train image captioning model

3. Setup your paths

4. Train text-to-image synthesis with image captioning

Results

1. Here are some results on MSCOCO

2. Transfer learning on MHP dataset

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages