Please use this identifier to cite or link to this item:
DC FieldValueLanguage
dc.contributor.authorAbuhammad, Yaraen_US
dc.contributor.authorAddabe, Yaraen_US
dc.contributor.authorAyyad, Natalyen_US
dc.contributor.authorYahya, Adnanen_US
dc.identifier.citationAbuhammad Y., Aldabe Y., Ayyad N. and Yahya, A. "Authorship Attribution of Modern Standard Arabic Short Texts". Proceedings of the Seventh International Conference on Arab Women in Computing ArabWIC21. Sharjah. United Arab Emirates. 25 August 2021.ACM proceedings.en_US
dc.descriptionArabWIC 2021: ArabWIC 7th International Conference on Arab Women in Computing In Conjunction with 2nd Forum for Women in Research As Part of the ArabWIC 2021 Conference Series August 25, 2021en_US
dc.description.abstractText data, including short texts, constitute a major share of web content. The availability of this data to billions of users triggers frequent plagiarism attacks. Authorship Attribution (AA) seeks to identify the most probable author of a given text based on similarity to the writing style of potential authors. In this paper, we approach AA as a writing style profile generation process, where we group text instances for each author into a single profile. We use Twitter as the source for our short Modern Standard Arabic (MSA) texts. Numerous experiments with various training approaches, tools and features allowed us to settle on a text representation method that relies on text concatenation of Arabic tweets to form chunks, which are then duplicated to reach a precalculated length. These chunks are used to train machine learning models for our 45 author profiles. This allowed us to achieve accuracies up to 99%, which compares favorably with the best results reported in the literature.en_US
dc.publisherAssociation for Computing Machinery-ACMen_US
dc.subjectArabic authorship attributionen_US
dc.subjectSupport vector classificationen_US
dc.subjectMachine learning - Techniqueen_US
dc.subjectSocial mediaen_US
dc.subjectAuthorship attribution datasetsen_US
dc.subjectText processing (Computer science), Arabicen_US
dc.subjectNatural language processing (Computer science) - Arab countriesen_US
dc.subjectTweets, Arabicen_US
dc.subjectComputational linguistics, Arabicen_US
dc.titleAuthorship Attribution of Modern Standard Arabic Short Textsen_US
dcterms.creatorAdnan Yahyaen_US
newfileds.departmentEngineering and Technologyen_US
newfileds.general-subjectComputers and Information Technology | الحاسوب وتكنولوجيا المعلوماتen_US
dc.relation.conferenceArabWIC 2021: ArabWIC 7th International Conference on Arab Women in Computing.en_US
item.fulltextWith Fulltext-
Appears in Collections:Fulltext Publications
Files in This Item:
File Description SizeFormat
AA_PAPER___ACM.pdfpdf submitted file484.16 kBAdobe PDFView/Open
Show simple item record

Page view(s)

checked on May 11, 2022


checked on May 11, 2022

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.