A Robust Document Processing System Combining Image Segmentation with Content-based Document Compression

Yibing Yang*, Hong Yan

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

3 Citations (Scopus)

Abstract

A document processing system combining image segmentation with content-based document compression is proposed in the paper. Firstly, a grayscale document image is divided into small blocks and analysed. Then, a modified logical thresholding method based on, local structure analysis and the adaptive logical level technique is used to transform the grayscale document into a binary image. We extract all patterns from the binary document and use a multistage matching method to extract representative patterns. A decomposition method is used to deal with relatively large patterns. Finally, high ratio compression is achieved by coding the relative positions of symbols, extracted representative patterns and other decomposed patterns using the adaptive arithmetic coder anal Q-Coder respectively. © 2000 IEEE.
Original languageEnglish
Title of host publicationProceedings 15th International Conference on Pattern Recognition, ICPR 2000
EditorsA. Sanfeliu, J. J. Villanueva , M. Vanrell, R. Alqukzar, J. Crowley , Y. Shirai
PublisherIEEE
Pages519-522
Volume4
ISBN (Print)0769507506
DOIs
Publication statusPublished - Sept 2000
Event15th International Conference on Pattern Recognition, ICPR 2000 - Barcelona, Spain
Duration: 3 Sept 20007 Sept 2000

Publication series

NameInternational Conference on Pattern Recognition
ISSN (Print)1051-4651

Conference

Conference15th International Conference on Pattern Recognition, ICPR 2000
PlaceSpain
CityBarcelona
Period3/09/007/09/00

Fingerprint

Dive into the research topics of 'A Robust Document Processing System Combining Image Segmentation with Content-based Document Compression'. Together they form a unique fingerprint.

Cite this