Please use this identifier to cite or link to this item:
Title: A Robust Line Segmentation Algorithm for Arabic Printed Text with Diacritics
Authors: Ayesh, Muna 
Mohammad, Khader 
Qaroush, Aziz 
Agaian, Sos 
Washha, Mahdi 
Keywords: Optical character recognition;Arabic language—Diacritics;Character sets (Data processing)
Issue Date: 2017
Abstract: Line segmentation performs a significant stage in the OCR systems; it has a direct effect on the character segmentation stage which affects the recognition rate. In this paper a robust algorithm is proposed for line segmentation for Arabic printed text system with and without diacritics based on finding the global maximum peak and the baseline detection. The algorithm is tested for different font sizes and types and results have been obtained from testing 5 types of fonts with total of 43,055 lines with 99.9 % accuracy for text without diacritics and 99.5% accuracy for text with diacritics.
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat
A robust line segmentation algorithm for Arabic printed text with diacritics.pdf2.98 MBAdobe PDFView/Open
Show full item record

Page view(s)

checked on Jun 18, 2024


checked on Jun 18, 2024

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.