|
Digital Library of the
European Council for Modelling and Simulation |
Title: |
Structural
Compression Of Document Images With PDF/A |
Authors: |
Sergey Usilin, Dmitry Nikolaev, Vassili Postnikov |
Published in: |
(2010).ECMS
2010 Proceedings edited by A Bargiela S A Ali D
Crowley E J H Kerckhoffs. European Council for
Modeling and Simulation. doi:10.7148/2010 ISBN:
978-0-9564944-1-2 24th
European Conference on Modelling and Simulation, Simulation Meets Global Challenges Kuala
Lumpur, June 1-4 2010 |
Citation
format: |
Usilin, S. A., Nikolaev,
D. P., & Postnikov, V. V. (2010). Structural
Compression Of Document Images With PDF/A. ECMS 2010 Proceedings edited by A Bargiela S A Ali D Crowley E J H Kerckhoffs
(pp. 242-246). European Council for Modeling and Simulation. doi:10.7148/2010-0242-0246 |
DOI: |
http://dx.doi.org/10.7148/2010-0242-0246 |
Abstract: |
This
paper describes a new compression algorithm of document images based on
separating the text layer from the graphics one on the initial image and
compression of each layer by the most suitable common algorithm. Then
compressed layers are placed into PDF/A, a standardizated
file format for long-term archiving of electronic documents. Using the
individual separation algorithm for each type of document makes it possible
to save the image to the best advantage. Moreover, the text layer can be
processed by an OCR system and the recognized text can also be placed into
the same PDF/A file for making it easy to perform cut and paste and text
search operations. |
Full
text: |