Please use this identifier to cite or link to this item:
DC FieldValueLanguage
dc.contributor.authorYahya, Adnan
dc.contributor.authorSalhi, Ali
dc.identifier.citation2. Yahya, A. and A. Salhi. "Arabic Text Correction Using Dynamic Categorized Dictionaries: A Statistical Approach" ; Linguistica Communicatio Journal (Selected Papers from CITALA 2012);, Volume 5, 2013.en_US
dc.descriptionSelected for Journal Publication from CITALA12 Conference.en_US
dc.description.abstractThis paper describes a technique for spelling and correcting Arabic text that provides different variables that can be controlled to give customized results based on the properties of the processed text. The proposed technique depends on dynamic dictionaries controlled and customized based on the input text categorization. In the research reported here we employ a statistical/corpus-based approach with data obtained from the Arabic Wikipedia and local Palestinian newspapers. Based on corpus statistics we constructed databases of words and their frequencies as single, double and triple expressions and used that as the infrastructure for our spelling and text correction technique. Our spelling technique builds on earlier work[7], but using new spelling variables and dynamic dictionaries based on categorized texts. We briefly report on the results of preliminary testing and analysis. While the results reported here are promising, they must be viewed as work in progress, still in need of more testing, refining, integration and deployment in real life settings.en_US
dc.subjectNatural language processing (Computer science)en_US
dc.subjectLanguage and languages - Computer-assisted instructionen_US
dc.subjectData miningen_US
dc.titleArabic text correction using dynamic categorized dictionaries: a statistical approachen_US
newfileds.departmentEngineering and TechnologyEngineering and Technologyen_US
newfileds.conferenceInternational journal of Computational and General Linguistics: LINGUISTICA COMMUNICATIO, Vol. 5, 2013.en_US
newfileds.general-subjectComputers and Information Technology | الحاسوب وتكنولوجيا المعلوماتen_US
item.fulltextWith Fulltext-
Appears in Collections:Fulltext Publications
Files in This Item:
File Description SizeFormat
YahyaSalhiLinguisticaComPaper.pdf601.84 kBAdobe PDFView/Open
Show simple item record

Page view(s)

Last Week
Last month
checked on Jan 2, 2022


checked on Jan 2, 2022

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.