Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/4925
DC FieldValueLanguage
dc.contributor.authorHashesh, Ala'
dc.contributor.authorAlkhamra, Othman
dc.contributor.authorSalameh, Ahmad
dc.contributor.authorSayyad, Abdel Salam
dc.date.accessioned2017-05-02T11:11:20Z
dc.date.available2017-05-02T11:11:20Z
dc.date.issued2017-04
dc.identifier.urihttp://hdl.handle.net/20.500.11889/4925
dc.description.abstractData is everywhere, but to extract specific information from huge data could be an exhausting process. However, there are many concepts introduced in computer science can be used to make this problem simpler, such as regular expressions. But, generating a regular expression capable of extracting a predefined string from a text is not an everyday task. In this research, Regular Expression are generated using Genetic Programming. The validity and correctness of a regular expression is decided by making it extract a set of positive examples and ignore another set of negative examples. We validate this method with three datasets related to IPv4 address extraction, article title extraction, and HTML Header extraction. The resulting regular expressions achieved very good accuracy of extraction for the given tasks.en_US
dc.language.isoenen_US
dc.subjectGenetic programming (Computer science)en_US
dc.subjectText processing (Computer science)en_US
dc.subjectProgramming languages (Electronic computers)en_US
dc.subjectElectronic data processingen_US
dc.subjectText editors (Computer programs)en_US
dc.titleIntelligent data extraction system using regular expressionsen_US
dc.typeConference Proceedingsen_US
newfileds.departmentEngineering and Technologyen_US
newfileds.conferenceNew trends in information technology (2017 : Amman, Jordan)en_US
newfileds.item-access-typeopen_accessen_US
newfileds.thesis-prognoneen_US
newfileds.general-subjectComputers and Information Technology | الحاسوب وتكنولوجيا المعلوماتen_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
item.languageiso639-1other-
Appears in Collections:Fulltext Publications
Files in This Item:
File Description SizeFormat
revised_regex_paper.pdf853.76 kBAdobe PDFView/Open
Show simple item record

Page view(s)

238
Last Week
0
Last month
7
checked on Apr 14, 2024

Download(s)

123
checked on Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.