Adaptive Histogram Analysis for Scene Text Binarization and Recognition
Main Article Content
Abstract
Scene text binarization and recognition is a challenging task due to different appearance of text in clutter background and uneven illumination in natural scene images. In this paper, we present a new method based on adaptive histogram analysis for each sliding window over a word of a text line detected by the text detection method. The histogram analysis works on the basis that intensity values of text pixels in each sliding window have uniform color. The method segments the words based on region growing which studies spacing between words and characters. Then we propose to use existing OCRs such as ABBYY and Tesseract (Google) to recognize the text line at word and character levels to validate the binarization results. The method is compared with well-known global thresholding technique of binarization to show its effectiveness.