Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/2373
Title: Building a corpus for Palestinian Arabic : a preliminary study
Authors: Jarrar, Mustafa
Habash, Nizar
Akra, Diyam Fuad
Zalmout, Nasser
Issue Date: 2014
Publisher: ACM
Abstract: This paper presents preliminary results in building an annotated corpus of the Palestinian Arabic dialect. The corpus consists of about 43K words, stemming from diverse resources. The paper discusses some linguistic facts about the Palestinian dialect, compared with the Modern Standard Arabic, especially in terms of morphological, orthographic, and lexical variations, and suggests some directions to resolve the challenges these differences pose to the annotation goal. Furthermore, we present two pilot studies that investigate whether existing tools for processing Modern Standard Arabic and Egyptian Arabic can be used to speed up the annotation process of our Palestinian Arabic corpus
URI: http://hdl.handle.net/20.500.11889/2373
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat
11331.pdf1.36 MBAdobe PDFView/Open
Show full item record

Page view(s)

40
Last Week
2
Last month
13
checked on Feb 25, 2020

Download(s)

33
checked on Feb 25, 2020

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.