|
Digital Library of the
European Council for Modelling and Simulation |
Title: |
Creating Training Datasets For OCR In Mobile Device Video Stream |
Authors: |
Dmitry A. Ilin, Valeriy E. Krivtsov |
Published in: |
(2015).ECMS 2015 Proceedings edited
by: Valeri M. Mladenov, Grisha Spasov, Petia Georgieva, Galidiya Petrova, European
Council for Modeling and Simulation. doi:10.7148/2015 ISBN:
978-0-9932440-0-1 29th
European Conference on Modelling and Simulation, Albena (Varna), Bulgaria,
May 26th – 29th,
2015 |
Citation
format: |
Dmitry
A. Ilin, Valeriy E. Krivtsov (2015).
Creating Training Datasets For OCR In
Mobile Device Video Stream, ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Petia Georgieva, Grisha Spasov, Galidiya Petrova European Council for Modeling and Simulation. doi:10.7148/2015-0516 |
DOI: |
http://dx.doi.org/10.7148/2015-0516 |
Abstract: |
This paper studies methods of data
sampling for training of convolutional neural
networks for character recognition. These methods are considered for optical character
recognition of machine readable zone (MRZ) of
documents captured by a mobile phone camera. Advantages and disadvantages of
training on natural and artificial datasets are discussed. In this paper we
describe some set of image transformations and give examples of their
practical implementation. At the end we show how adding artificial examples
(to the training database), generated according to the analysis of
recognition error, improves the quality of recognition. |
Full
text: |