Please use this identifier to cite or link to this item:
Title: Building a corpus for Palestinian Arabic : a preliminary study
Authors: Jarrar, Mustafa
Habash, Nizar
Akra, Diyam Fuad
Zalmout, Nasser
Issue Date: 2014
Publisher: ACM
Abstract: This paper presents preliminary results in building an annotated corpus of the Palestinian Arabic dialect. The corpus consists of about 43K words, stemming from diverse resources. The paper discusses some linguistic facts about the Palestinian dialect, compared with the Modern Standard Arabic, especially in terms of morphological, orthographic, and lexical variations, and suggests some directions to resolve the challenges these differences pose to the annotation goal. Furthermore, we present two pilot studies that investigate whether existing tools for processing Modern Standard Arabic and Egyptian Arabic can be used to speed up the annotation process of our Palestinian Arabic corpus
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat
11331.pdf1.36 MBAdobe PDFView/Open
Show full item record

Page view(s)

Last Week
Last month
checked on May 11, 2022


checked on May 11, 2022

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.