Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/4507
DC FieldValueLanguage
dc.contributor.authorJarrar, Mustafa
dc.contributor.authorHabash, Nizar
dc.contributor.authorAlrimawi, Faeq
dc.contributor.authorDiyam, Akra
dc.contributor.authorZalmout, Nasser
dc.date.accessioned2017-03-13T07:05:39Z
dc.date.available2017-03-13T07:05:39Z
dc.date.issued2016-12-08
dc.identifier.urihttp://hdl.handle.net/20.500.11889/4507
dc.description.abstractIn this article we present Curras, the first morphologically annotated corpus of the Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of the Arabic language. Arabic dialects are generally under-resourced compared to Modern Standard Arabic, the primarily written and official form of Arabic. We start in the article with a background description that situates Palestinian Arabic linguistically and historically and compares it to Modern Standard Arabic and Egyptian Arabic in terms of phonological, morphological, orthographic, and lexical variations. We then describe the methodology we developed to collect Palestinian Arabic text to guarantee a variety of representative domains and genres. We also discuss the annotation process we used, which extended previous efforts for annotation guideline development, and utilized existing automatic annotation solutions for Standard Arabic and Egyptian Arabic. The annotation guidelines and annotation meta-data are described in detail. The Curras Palestinian Arabic corpus consists of more than 56K tokens, which are annotated with rich morphological and lexical features. The inter-annotator agreement results indicate a high degree of consistency.en_US
dc.description.sponsorshipCurras project, funded by the Palestinian Ministry of Higher Education, Scientific Research Councilen_US
dc.language.isoen_USen_US
dc.publisherSpringeren_US
dc.relation.ispartofseriesJournal Language Resources and Evaluation. ; Pages(1-31) Volume(50), Issue (219)
dc.subjectArabic language - Dialects - Palestineen_US
dc.subjectInscriptions, Arabic - Palestineen_US
dc.subjectArabic language – Morphologyen_US
dc.subjectArabic language - Orthography and spelling - Palestineen_US
dc.subjectWord annotationen_US
dc.titleCurras : an annotated corpus for the Palestinian Arabic dialecten_US
dc.typeArticleen_US
newfileds.departmentEngineering and TechnologyEngineering and Technologyen_US
newfileds.item-access-typeopen_accessen_US
newfileds.thesis-prognoneen_US
newfileds.general-subjectComputers and Information Technology | الحاسوب وتكنولوجيا المعلوماتen_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
item.languageiso639-1other-
Appears in Collections:Fulltext Publications
Files in This Item:
File Description SizeFormat
JHRAZ17.pdf1.23 MBAdobe PDFView/Open
Show simple item record

Page view(s)

306
Last Week
0
Last month
2
checked on Apr 14, 2024

Download(s)

429
checked on Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.