Document Layout Analysis Based on Fuzzy Energy Matrix Oh, KangHan; Kim, SooHyung;
In this paper, we describe a novel method for document layout analysis that is based on a Fuzzy Energy Matrix (FEM). A FEM is a two-dimensional matrix that contains the likelihood of text and non-text and is generated through the use of Fuzzy theory. The key idea is to define an Energy map for the document to categorize text and non-text. The proposed mechanism is designed for execution with a low-resolution document image, and hence our method has a fast processing speed. The proposed method has been tested on public ICDAR 2009 datasets to conduct a comparison against other state-of-the-art methods, and it was also tested with Korean documents. The results of the experiment indicate that this scheme achieves superior segmentation accuracy, in terms of both precision and recall, and also requires less time for computation than other state-of-the-art document image analysis methods.
Fuzzy Energy Matrix;Document Layout Segmentation;Fuzzy Set;
P. Bhupendra Kumar and S. Sanjay, “Integrated Fuzzy-HMM for project uncertainties in time-cost tradeoff problem,” J. Applied Soft Computing, vol. 21, 2014, pp. 320-329.
S. M. Chen, “Forecasting enrollments based on fuzzy time series,” Fuzzy Sets and Systems, vol. 81, 1996, pp. 311-319.
N. Otsu, “A Threshold Selection method from Gray Level Histogram,” IEEE Transactions on Systems, Man and Cybernetics, vol. 9, no. 1, 1975, pp. 62-66.
J. Sauvola and M. Pietikainen, “Adaptive document image binarization,” J. The Journal of the Pattern Recognition, 2000, pp. 225-236
A. Fletcher and R. Kasturi, “A robust algorithm for text string separation from mixed text/graphics images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 10, 1998, pp. 294-308.
K. J. Anil and Y. Bin, “Document representation and its application to page decomposition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, 1998, pp. 294-308.
D. L. O’Gorman, “The document spectrum for page layout analysis,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 15, 1993, pp. 1162-1173.
C. Laura, C. Ciro, and G. Przemyslaw, “Document page segmentation using neuro-fuzzy approach,” Applied Soft Computing, vol. 8, 2008, pp. 118-126.
K. Kise, A. Sato, and M. Iwata, “Segmentation of page images using the area Voronoi diagram,” In Computer Vision Image Understanding, vol. 70, no. 3, 1998, pp. 370-382.
D. H. S. Baird, “Backgroud structure in document images,” Int. J. Pattern Recognition Artif. Intell., vol. 8, 1994, pp. 1013-1030.
K. J. Anil and Y. Bin, “Document representation and its application to pagedecomposition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, 1998, pp. 294-308.
A. Antonacopoulos, S. Pletschacher, D. Bridson, and C. Papadopoulos, “ICDAR2009 Page Segmentation Competition”, Proc. ICDAR, 2009, pp. 1370-1374.
Ray Smith, “Hybrid page Layout Analysis via Tab-Stop Detection”, Proc. ICDAR, Barcelona, Spain, 2009, pp. 241-245.