Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/2373
Title: Building a corpus for Palestinian Arabic : a preliminary study
Authors: Jarrar, Mustafa
Habash, Nizar
Akra, Diyam
Zalmout, Nasser
Issue Date: 2014
Publisher: ACM
Abstract: This paper presents preliminary results in building an annotated corpus of the Palestinian Arabic dialect. The corpus consists of about 43K words, stemming from diverse resources. The paper discusses some linguistic facts about the Palestinian dialect, compared with the Modern Standard Arabic, especially in terms of morphological, orthographic, and lexical variations, and suggests some directions to resolve the challenges these differences pose to the annotation goal. Furthermore, we present two pilot studies that investigate whether existing tools for processing Modern Standard Arabic and Egyptian Arabic can be used to speed up the annotation process of our Palestinian Arabic corpus
URI: http://hdl.handle.net/20.500.11889/2373
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat 
11331.pdf1.36 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.