Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/4390
Title: Quality assessment of Arabic web content: the case of Arabic Wikipedia
Authors: Yahya, Adnan
Salhi, Ali
Keywords: Document Quality Assessment
Wikipedia, Arabic
Web sites - Design - Quality
Issue Date: Nov-2014
Abstract: With the huge size and large diversity of Arabic web content, machine assessment of document quality acquires added importance. Users are in dire need for quality rating of the material returned in response to their queries. The Wikipedia, with its large metadata, has been a topic of extensive research on document quality assessment. Criteria used include text properties and style parameters, contributor and edit characteristics and multimedia components. In this paper we report on our ongoing work to adapt existing document assessment approaches to Arabic content with concentration on the Arabic Wikipedia and present some of the results. We also try to augment that with features specific to Arabic as well as parameters like author expertise and social media presence. One of our goals is an aggregate measure integrating many of the features into a single document quality index. We plan to use Wikipedia article quality assessment results to train general content assessment methods that can be applied to general content that lacks major Wikipedia features.
URI: http://hdl.handle.net/20.500.11889/4390
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat 
YahyaSalhiInnvations2014Paper.pdf779.81 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.