Hits:
Indexed by:会议论文
Date of Publication:2006-06-21
Included Journals:EI、CPCI-S、Scopus
Volume:2
Page Number:COVER3-COVER3
Key Words:local maximum component; component-labeling algorithm; parallelizable; layout analysis
Abstract:A definition of the local maximum component (the component for short) is presented for layout analysis in document image analysis (DIA), and a novel algorithm for component labeling was described. This algorithm uses a contour tracing technique to detect and label the external contour of each component, and removes the interior area of each component from the copy of the source image. Labeling and removing are completed in a single pass over source image. Experiments on various kinds of images (title, text, picture, table and formula) show that the new definition and algorithm are more efficient and flexible than the traditional labeling ones.