Scene Graph Parsing as Dependency Parsing

This repository contains code for Scene Graph Parsing as Dependency Parsing, NAACL 2018.

If you use the code, please cite

@inproceedings{wang2018sgparser,     
  title={Scene Graph Parsing as Dependency Parsing},  
  author={Wang, Yu-Siang and Liu, Chenxi and Zeng, Xiaohui and Yuille, Alan},  
  booktitle={NAACL},  
  year={2018}
}

Requirements

python 2.7
PyTorch v0.3

Files

./model

include training codes

./preprocess

include mapping from json data to conll file

./coco-caption

evaluate coco caption with dependency sg parser

Customized Dependency Scene Graph Parser

Data Split

For training (development) data, we adopt the overlap images between Visual Genome training (develpment) dataset and MSCOCO training (development) dataset. You can check coco_id in image_data.json.zip to verify whether the images are in the set of the overlap between Visual Genome training (develpment) dataset and MSCOCO training (development) dataset.

Preprocessing

Split Visual Genome image_data.json.zip, region_graphs.json.zip, attributes.json.zip into 10 pieces, and named these files x_%num, where x={image_data, all_regioin_graphs, all_attributes} and num={0..9}. (The size for the every first 9 pieces is all_region_graph.size()/10) and put them into preprocess/. The reason to split to 10 pieces is for acclerating the preprocessing speed.
Set nltk wornet path in line 10 in data_to_conll.py

run

bash ./preprocess.sh

then run

python split_preprocess.py NUM TARGET

where NUM={0..9}, TARGET={coco_train, coco_dev}, which means there are total 20 commands to run.We stronly suggest you to open multiple terminals to execute above commands to fasten the preprocesseing time.

Finally run

python split.py coco_train merge
python split.py coco_dev   merge
python data_to_conll.py --input pre_coco_train.json --output coco_train.conll --train True
python data_to_conll.py --input pre_coco_dev.json   --output coco_dev.conll   --train False

--input: processed data generated by split.py --output: desired output --train : determine whether the file is used for training or not

Training

cd ./model/
bash run.sh

In run.sh, you can set the paramters

time CUDA_VISIBLE_DEVICES=2 python src/parser.py --outdir ./output --train coco_train.conll --dev coco_dev.conll --epochs 30 --lstmdims 256 --lstmlayers 2  --k 3 --usehead --userl

Please ignore .txt files which are useless

Validation

cd ./model/src/utils/evaluation_script/
bash eval.sh

In eval.sh, pre_coco_dev.json is the ground truth file.

time python spice_eval.py pre_coco_dev.json output/predict_coco.conll

Remember to set nltk_wordnet path in ./model/src/utils/evaluation_script/spice_eval.py to use wordnet for spice metric

Testing

python src/parser.py --predict --outdir [results directory] --test test.conll [--extrn extrn.vectors] --model [trained model file] --params [param file generate during training]

Pre-trained Model

We also released our pretrained model in the following website: https://gitlab.com/Yusics/SG_weights

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
model		model
preprocess		preprocess
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scene Graph Parsing as Dependency Parsing

Requirements

Files

Customized Dependency Scene Graph Parser

Data Split

Preprocessing

Training

Validation

Testing

Pre-trained Model

About

Uh oh!

Releases

Packages

Languages

License

Yusics/bist-parser

Folders and files

Latest commit

History

Repository files navigation

Scene Graph Parsing as Dependency Parsing

Requirements

Files

Customized Dependency Scene Graph Parser

Data Split

Preprocessing

Training

Validation

Testing

Pre-trained Model

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages