Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/5536
DC FieldValueLanguage
dc.contributor.authorOdeh, Feras-
dc.date.accessioned2018-05-07T09:44:58Z-
dc.date.available2018-05-07T09:44:58Z-
dc.date.issued2018-
dc.identifier.urihttp://hdl.handle.net/20.500.11889/5536-
dc.description.abstractThe recent popularity of the social media networks including forums, blogs, and micro-blogging networks changed the way patients share their health experiences and treatment options. Such forums offer valuable, unsolicited, uncensored information on drug safety and side effects directly from patients. However, it is very challenging to extract useful information from such forums due to several factors such as grammatical and spelling errors, colloquial language, and post length limitation. Furthermore, due to the sensitivity of the domain for adverse drug reactions (ADR) detection, it is more critical to identify correct ADRs (i.e., achieve higher classification precision) than identifying non-precise ones. The aims of this thesis are: (i) to develop a new approach for ADR classification in twitter posts called Semantic Vector(SemVec); (ii) to explore natural language processing (NLP) approaches for generating domain features from text, and utilizing them for ADRs detection; and (iii) to improve convolution neural network (CNN) ADR classifi- cation precision by incorporating domain features. This thesis proposes a dynamic and pluggable model, named SemVec, for representing words as a vector of both domain and morphological features. Based on the problem domain, domain features can be added or removed to generate an enriched word representation with domain knowledge. SemVec represents each post as a matrix of word vectors, which is fed into CNN. SemVec is scalable, can be applied to other domains by employing relevant natural language processing methods and domain lexicons. The proposed method was evaluated on Twitter (ADR) dataset. Results show that SemVec improves the precision of ADR detection by 13.43% over other state-of-the-art deep learning methods with a comparable recall score.en_US
dc.language.isoen_USen_US
dc.subjectMedicine - Data processingen_US
dc.subjectSocial media in medicineen_US
dc.subjectDrugs - Side effectsen_US
dc.subjectOnline social networks - Safety measuresen_US
dc.subjectInformation storage and retrieval systems - Medical careen_US
dc.subjectSemantic weben_US
dc.titleA domain-based feature generation and convolution neural network approach for extracting adverse drug reactions from social media postsen_US
dc.typeThesisen_US
newfileds.departmentGraduate Studiesen_US
newfileds.item-access-typeopen_accessen_US
newfileds.thesis-progScientific Computationen_US
newfileds.general-subjectnoneen_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
item.languageiso639-1other-
Appears in Collections:Theses
Files in This Item:
File Description SizeFormat
Thesis(1).pdf2.11 MBAdobe PDFView/Open
Show simple item record

Page view(s)

142
Last Week
0
Last month
2
checked on Apr 14, 2024

Download(s)

102
checked on Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.