Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.11889/4485
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yahya, Adnan | |
dc.contributor.author | Salhi, Ali | |
dc.date.accessioned | 2017-03-11T06:47:11Z | |
dc.date.available | 2017-03-11T06:47:11Z | |
dc.date.issued | 2013-05 | |
dc.identifier.citation | 2. Yahya, A. and A. Salhi. "Arabic Text Correction Using Dynamic Categorized Dictionaries: A Statistical Approach" ; Linguistica Communicatio Journal (Selected Papers from CITALA 2012);, Volume 5, 2013. | en_US |
dc.identifier.uri | http://hdl.handle.net/20.500.11889/4485 | |
dc.description | Selected for Journal Publication from CITALA12 Conference. | en_US |
dc.description.abstract | This paper describes a technique for spelling and correcting Arabic text that provides different variables that can be controlled to give customized results based on the properties of the processed text. The proposed technique depends on dynamic dictionaries controlled and customized based on the input text categorization. In the research reported here we employ a statistical/corpus-based approach with data obtained from the Arabic Wikipedia and local Palestinian newspapers. Based on corpus statistics we constructed databases of words and their frequencies as single, double and triple expressions and used that as the infrastructure for our spelling and text correction technique. Our spelling technique builds on earlier work[7], but using new spelling variables and dynamic dictionaries based on categorized texts. We briefly report on the results of preliminary testing and analysis. While the results reported here are promising, they must be viewed as work in progress, still in need of more testing, refining, integration and deployment in real life settings. | en_US |
dc.language.iso | en | en_US |
dc.subject | Natural language processing (Computer science) | en_US |
dc.subject | Wikipedia | en_US |
dc.subject | Editing | en_US |
dc.subject | Language and languages - Computer-assisted instruction | en_US |
dc.subject | Data mining | en_US |
dc.title | Arabic text correction using dynamic categorized dictionaries: a statistical approach | en_US |
dc.type | Article | en_US |
newfileds.department | Engineering and TechnologyEngineering and Technology | en_US |
newfileds.conference | International journal of Computational and General Linguistics: LINGUISTICA COMMUNICATIO, Vol. 5, 2013. | en_US |
newfileds.item-access-type | bzu | en_US |
newfileds.thesis-prog | none | en_US |
newfileds.general-subject | Computers and Information Technology | الحاسوب وتكنولوجيا المعلومات | en_US |
item.languageiso639-1 | other | - |
item.fulltext | With Fulltext | - |
item.grantfulltext | open | - |
Appears in Collections: | Fulltext Publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
YahyaSalhiLinguisticaComPaper.pdf | 601.84 kB | Adobe PDF | View/Open |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.