[GSoC] Add digit and text recognition samples. #17675

zihaomu · 2020-06-27T07:49:13Z

Hi, this is my GSoC project to add digit and text recognition samples.

Status Update:

The detailed tutorial of OCR models usage method and how to train your own OCR model have been added to the doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown.

1. digit recognition

Take the live image from the camera, use connected component analysis to detect potential regions with each digit, and use the LeNet to classify.
With CPU only (i5-8300), it can achieve 12 FPS.

2. scene text recognition

My laptop environment is CPU: i5-8300, GPU: 1050, Ubuntu 18
Take the live image from the camera, use EAST as text detector. After getting the detector output, crop these bounding box as the input of the text recognizer based on VGG Net. Finally, print the result near the box. Using GPU can achieve around 9FPS.

After loading the model into OpenCV, test the performance of the text recognition model on different data sets. And the result is filled in following table:

Model name	IIIT5k(%)	SVT(%)	ICDAR03(%)	ICDAR13(%)	ICDAR15(%)	SVTP(%)	CUTE80(%)	average acc (%)	FPS	parameter( x10^6 )
DenseNet-CTC	72.267	67.39	82.814	80	48.387	49.457	42.509	63.260571	134.63	0.239
DenseNet-BiLSTM-CTC	73.767	72.334	86.159	83.153	50.676	57.984	49.826	67.699857	27.59	3.636
VGG-CTC	75.967	75.425	85.928	83.547	54.891	57.519	50.174	69.064429	108.04	5.569
CRNN_VGG-BiLSTM-CTC	82.633	82.071	92.964	88.867	66.285	71.008	62.369	78.028143	31.94	8.452
ResNet-CTC	84	84.08	92.388	88.966	67.742	74.729	67.596	79.928714	15.87	44.283

These pre-trained models can be found here https://drive.google.com/drive/folders/1cTbQ3nuZG-EKWak6emD_s8_hHXWz7lAr?usp=sharing. The FPS in the table is the performance of the text recognition model on my computer, and does not include the text detection model.

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under OpenCV (BSD) License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

samples/dnn/text_detection_and_recognition.cpp

samples/cpp/digits.cpp

alalek · 2020-06-29T08:08:38Z

I have closed the previous PR (#17462) which contains LeNet_digit recognition only.

Users may want to compare different algorithms/approaches on their tasks.

zihaomu · 2020-06-30T06:43:02Z

I have closed the previous PR (#17462) which contains LeNet_digit recognition only.

Users may want to compare different algorithms/approaches on their tasks.

Hi, I keep the original digits.cpp as digits_SVM.cpp, and add the proposal as digits_LeNet.cpp.

samples/dnn/text_detection.cpp

zihaomu

I modified the original font size, the previous one is too big to affect the display effect.

samples/dnn/text_detection.cpp

vpisarev · 2020-07-29T21:18:58Z

@zihaomu, thank you very much! I've tested your code and it works great!

Can you, please, also update text_detection.py to use the same model for detection and add OCR part?

samples/cpp/digits_LeNet.cpp

zihaomu · 2020-07-31T11:09:00Z

@zihaomu, thank you very much! I've tested your code and it works great!

Can you, please, also update text_detection.py to use the same model for detection and add OCR part?

@vpisarev Thank you for your reply.
Indeed, the implementation of text_detection.py does have errors. In order not to affect this PR of GSoC, I have created a new PR #17992.

samples/dnn/text_detection.cpp

samples/cpp/digits_LeNet.cpp

samples/dnn/text_detection.py

samples/cpp/digits_LeNet.cpp

samples/dnn/text_detection.py

doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown

dkurt · 2020-08-21T14:03:35Z

It seems to me ready for merge. May I ask to squash all the commits into one?

…le OCR models.

dkurt

👍 Thank you!

zihaomu mentioned this pull request Jun 27, 2020

Add dnn based digits recognition samples #17462

Closed

6 tasks

dkurt reviewed Jun 28, 2020

View reviewed changes

samples/dnn/text_detection_and_recognition.cpp Outdated Show resolved Hide resolved

dkurt reviewed Jun 28, 2020

View reviewed changes

samples/cpp/digits.cpp Outdated Show resolved Hide resolved

zihaomu requested a review from dkurt June 30, 2020 07:00

dkurt reviewed Jun 30, 2020

View reviewed changes

samples/dnn/text_detection.cpp Show resolved Hide resolved

zihaomu commented Jul 15, 2020

View reviewed changes

samples/dnn/text_detection.cpp Outdated Show resolved Hide resolved

vpisarev reviewed Jul 29, 2020

View reviewed changes

samples/cpp/digits_LeNet.cpp Outdated Show resolved Hide resolved

dkurt reviewed Jul 31, 2020

View reviewed changes

samples/dnn/text_detection.cpp Outdated Show resolved Hide resolved

dkurt reviewed Jul 31, 2020

View reviewed changes

samples/dnn/text_detection.cpp Show resolved Hide resolved

dkurt reviewed Jul 31, 2020

View reviewed changes

samples/cpp/digits_LeNet.cpp Outdated Show resolved Hide resolved

dkurt reviewed Jul 31, 2020

View reviewed changes

samples/dnn/text_detection.py Show resolved Hide resolved

dkurt reviewed Aug 4, 2020

View reviewed changes

samples/cpp/digits_LeNet.cpp Outdated Show resolved Hide resolved

dkurt reviewed Aug 4, 2020

View reviewed changes

samples/dnn/text_detection.py Outdated Show resolved Hide resolved

dkurt reviewed Aug 21, 2020

View reviewed changes

doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown Outdated Show resolved Hide resolved

dkurt reviewed Aug 21, 2020

View reviewed changes

doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown Outdated Show resolved Hide resolved

dkurt reviewed Aug 21, 2020

View reviewed changes

doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown Show resolved Hide resolved

dkurt reviewed Aug 21, 2020

View reviewed changes

doc/tutorials/dnn/dnn_OCR/dnn_OCR.markdown Outdated Show resolved Hide resolved

add OpenCV sample for digit and text recongnition, and provide multip…

397ba2d

…le OCR models.

zihaomu force-pushed the GSoC_digit_text_detect_and_recog branch from 5f7eef9 to 397ba2d Compare August 21, 2020 17:10

dkurt approved these changes Aug 21, 2020

View reviewed changes

dkurt self-assigned this Aug 21, 2020

alalek merged commit 3547ac4 into opencv:master Aug 22, 2020

zihaomu changed the title ~~[GSoC] Add digit and text recongnition samples.~~ [GSoC] Add digit and text recognition samples. Sep 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSoC] Add digit and text recognition samples. #17675

[GSoC] Add digit and text recognition samples. #17675

Uh oh!

zihaomu commented Jun 27, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

alalek commented Jun 29, 2020

Uh oh!

zihaomu commented Jun 30, 2020

Uh oh!

Uh oh!

zihaomu left a comment

Uh oh!

Uh oh!

vpisarev commented Jul 29, 2020

Uh oh!

Uh oh!

zihaomu commented Jul 31, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkurt commented Aug 21, 2020

Uh oh!

dkurt left a comment

Uh oh!

Uh oh!

Uh oh!

[GSoC] Add digit and text recognition samples. #17675

[GSoC] Add digit and text recognition samples. #17675

Uh oh!

Conversation

zihaomu commented Jun 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Status Update:

1. digit recognition

2. scene text recognition

Pull Request Readiness Checklist

Uh oh!

Uh oh!

Uh oh!

alalek commented Jun 29, 2020

Uh oh!

zihaomu commented Jun 30, 2020

Uh oh!

Uh oh!

zihaomu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vpisarev commented Jul 29, 2020

Uh oh!

Uh oh!

zihaomu commented Jul 31, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkurt commented Aug 21, 2020

Uh oh!

dkurt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zihaomu commented Jun 27, 2020 •

edited

Loading