You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a rough implementation of the paper. Since I do not have a titan gpu, I made some modifications on the algorithm, but you can easily change them back if you want the exact setting from the paper.
for accurate prediction, please pre-train noise stream's vgg weights on ImageNet and overwrite the trainable setting of noise stream after SRM conv layer
Bounding boxes are predicted by both streams.
In the paper, RGB stream alone predicts bbox more accurately, so you may wanna change that as well (also defined in vgg16.py)
Use main_create_training_set.py to create training set from PASCAL VOC dataset.
The generated dataset will follow the pascal voc style, which is also required by train.py
Tensorboard file will be save at /default
Weights will be save to /default/DIY_detaset/default
Note
The code requires a large memory GPU. If you do not have a 6G+ GPU, please reduce the number of noise stream conv layers for training.
Demo results
Dataset size: 10000, epoch: 3
Finally
I will update this repo a few weeks later after I installed the new GPU
About
Paper: CVPR2018, Learning Rich Features for Image Manipulation Detection