ecms_neu_mini.png

Digital Library

of the European Council for Modelling and Simulation

 

Title:

Creating Training Datasets For OCR In Mobile Device Video Stream

Authors:

Dmitry A. Ilin, Valeriy E. Krivtsov

Published in:

 

 

(2015).ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Grisha Spasov, Petia Georgieva, Galidiya Petrova, European Council for Modeling and Simulation. doi:10.7148/2015

 

 

ISBN: 978-0-9932440-0-1

 

29th European Conference on Modelling and Simulation,

Albena (Varna), Bulgaria, May 26th – 29th, 2015

 

Citation format:

Dmitry A. Ilin, Valeriy E. Krivtsov (2015). Creating Training Datasets For OCR In Mobile Device Video Stream, ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Petia Georgieva, Grisha Spasov, Galidiya Petrova  European Council for Modeling and Simulation. doi:10.7148/2015-0516

DOI:

http://dx.doi.org/10.7148/2015-0516

Abstract:

This paper studies methods of data sampling for training of convolutional neural networks for character recognition. These methods are considered for optical character recognition of machine readable zone (MRZ) of documents captured by a mobile phone camera. Advantages and disadvantages of training on natural and artificial datasets are discussed. In this paper we describe some set of image transformations and give examples of their practical implementation. At the end we show how adding artificial examples (to the training database), generated according to the analysis of recognition error, improves the quality of recognition.

 

Full text: