You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Aug 24, 2020. It is now read-only.
A project for research in text detection and recognition using PyTorch 1.2.
This project is originated from the research repo, which heavily relies on closed-source libraries, of CSG-Algorithm team of Megvii(https://megvii.com).
We are in ongoing progress to transfer models into this repo gradually, released implementations are listed in Progress.
Highlights
Implementations of representative text detection and recognition methods.
An effective framework for conducting experiments: We use yaml files to configure experiments, making it convenient to take experiments.
Thorough logging features which make it easy to follow and analyze experimental results.
CPU/GPU compatible for training and inference.
Distributed training support.
Install
Requirements
pip install -r requirements.txt
Python3.7
PyTorch 1.2 and CUDA 10.0.
gcc 5.5(Important for compiling)
Compile cuda ops (If needed)
cd PATH_TO_OPS
python setup.py build_ext --inplace
ops may be used:
DeformableConvV2 assets/ops/dcn
CTC2DLoss ops/ctc_2d
Configuration(optional)
Edit configurations in config.py.
Training
See detailed options: python3 train.py --help
Datasets
We provide data loading implementation with annotation packed with json for quick start.
Also, lmdb format data are now available too.
You can refer the usage in demo.
Datasets used in our recognition experiments can be downloaded from onedrive. The transform script are provide to convert json format data to lmdb.