Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/8402
Title: Extractive multi-document Arabic text summarization using evolutionary multi-objective optimization with K-medoid clustering
Authors: Alqaisi, Rana 
Ghanem, Wasel 
Qaroush, Aziz 
Keywords: Extractive text summarization;Natural language processing (Computer science);Data mining;Big data;Multi-objective optimization;Less redundancy;Maximum coverage and relevancy;Extractive text summarization
Issue Date: 2020
Publisher: IEEE Access
Abstract: The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.
URI: http://hdl.handle.net/20.500.11889/8402
DOI: 10.1109/ACCESS.2020.3046494
Appears in Collections:Fulltext Publications

Show full item record

Page view(s)

39
checked on Jun 18, 2024

Download(s)

10
checked on Jun 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.