Skip to main navigation Skip to search Skip to main content

Newspaper layout analysis incorporating connected component separation

    Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

    Abstract

    This paper presents an algorithm that performs automated segmentation and classification of newspaper images. A notable feature of the algorithm is a technique for segmenting components that are connected to other components. In particular, horizontal lines and vertical lines, which can be vital in determining the page layout, can be segmented from other lines and other components. The algorithm uses a bottom-up approach to initially segment the image, classify patterns and extract text lines. The classified patterns are then merged into complete regions. The algorithm is tested on a set of complex English and Greek newspaper images dating back to 1900. © 2003 Published by Elsevier B.V.
    Original languageEnglish
    Pages (from-to)307-317
    JournalImage and Vision Computing
    Volume22
    Issue number4
    DOIs
    Publication statusPublished - 1 Apr 2004

    Research Keywords

    • Document analysis
    • Image segmentation
    • Newspaper segmentation

    Fingerprint

    Dive into the research topics of 'Newspaper layout analysis incorporating connected component separation'. Together they form a unique fingerprint.

    Cite this