ecms_neu_mini.png

Digital Library

of the European Council for Modelling and Simulation

 

Title:

Structural Compression Of Document Images With PDF/A

Authors:

Sergey Usilin, Dmitry Nikolaev, Vassili Postnikov

Published in:

 

(2010).ECMS 2010 Proceedings edited by A Bargiela S A Ali D Crowley E J H Kerckhoffs. European Council for Modeling and Simulation. doi:10.7148/2010 

 

ISBN: 978-0-9564944-1-2

 

24th European Conference on Modelling and Simulation,

Simulation Meets Global Challenges

Kuala Lumpur, June 1-4 2010

 

Citation format:

Usilin, S. A., Nikolaev, D. P., & Postnikov, V. V. (2010). Structural Compression Of Document Images With PDF/A. ECMS 2010 Proceedings edited by A Bargiela S A Ali D Crowley E J H Kerckhoffs (pp. 242-246). European Council for Modeling and Simulation. doi:10.7148/2010-0242-0246

DOI:

http://dx.doi.org/10.7148/2010-0242-0246

Abstract:

This paper describes a new compression algorithm of document images based on separating the text layer from the graphics one on the initial image and compression of each layer by the most suitable common algorithm. Then compressed layers are placed into PDF/A, a standardizated file format for long-term archiving of electronic documents. Using the individual separation algorithm for each type of document makes it possible to save the image to the best advantage. Moreover, the text layer can be processed by an OCR system and the recognized text can also be placed into the same PDF/A file for making it easy to perform cut and paste and text search operations.

Full text: