Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.11889/4925
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hashesh, Ala' | |
dc.contributor.author | Alkhamra, Othman | |
dc.contributor.author | Salameh, Ahmad | |
dc.contributor.author | Sayyad, Abdel Salam | |
dc.date.accessioned | 2017-05-02T11:11:20Z | |
dc.date.available | 2017-05-02T11:11:20Z | |
dc.date.issued | 2017-04 | |
dc.identifier.uri | http://hdl.handle.net/20.500.11889/4925 | |
dc.description.abstract | Data is everywhere, but to extract specific information from huge data could be an exhausting process. However, there are many concepts introduced in computer science can be used to make this problem simpler, such as regular expressions. But, generating a regular expression capable of extracting a predefined string from a text is not an everyday task. In this research, Regular Expression are generated using Genetic Programming. The validity and correctness of a regular expression is decided by making it extract a set of positive examples and ignore another set of negative examples. We validate this method with three datasets related to IPv4 address extraction, article title extraction, and HTML Header extraction. The resulting regular expressions achieved very good accuracy of extraction for the given tasks. | en_US |
dc.language.iso | en | en_US |
dc.subject | Genetic programming (Computer science) | en_US |
dc.subject | Text processing (Computer science) | en_US |
dc.subject | Programming languages (Electronic computers) | en_US |
dc.subject | Electronic data processing | en_US |
dc.subject | Text editors (Computer programs) | en_US |
dc.title | Intelligent data extraction system using regular expressions | en_US |
dc.type | Conference Proceedings | en_US |
newfileds.department | Engineering and Technology | en_US |
newfileds.conference | New trends in information technology (2017 : Amman, Jordan) | en_US |
newfileds.item-access-type | open_access | en_US |
newfileds.thesis-prog | none | en_US |
newfileds.general-subject | Computers and Information Technology | الحاسوب وتكنولوجيا المعلومات | en_US |
item.languageiso639-1 | other | - |
item.fulltext | With Fulltext | - |
item.grantfulltext | open | - |
Appears in Collections: | Fulltext Publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
revised_regex_paper.pdf | 853.76 kB | Adobe PDF | View/Open |
Page view(s)
236
Last Week
0
0
Last month
7
7
checked on Mar 25, 2024
Download(s)
123
checked on Mar 25, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.