Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity
Abstract
Problem statement: Skew detection and correction is the first step process in the document analysis and understanding processing steps. Correction the skewed scanned document image is very important, because it has a direct effect on the reliability and efficiency of the segmentation and feature extraction stages. The noises and the deviation in the document resolution or types are still the main two challenges facing the Arabic skew detection and correction methods. Approach: The proposed method work involved inscribing the text in the document by an arbitrary polygon and derivation of the baseline from polygon’s centroid. Results: The proposed method was implemented on 150 different scanned Arabic documents, from different sources like journals, textbooks, newspapers and the like in addition to handwritten document, with different resolutions and different fonts and it was obtained an accuracy ratio of 87%. Conclusion: The proposed method was efficient, simple and fast, it was not affected by noise and it was proved their suitability to work with documents with different fonts and documents with different resolutions.
DOI: https://doi.org/10.3844/jcssp.2009.363.368
Copyright: © 2009 Atallah Mahmoud Al-Shatnawi and Khairuddin Omar. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,879 Views
- 3,351 Downloads
- 24 Citations
Download
Keywords
- Arabic document
- skew detection
- skew correction
- centre of gravity