Statistical/corpus based methods for improved bilingual (Arabic/English) web search

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/4393

Title:	Statistical/corpus based methods for improved bilingual (Arabic/English) web search
Authors:	Yahya, Adnan Hithnawi, Anwar Salhi, Ali Fawadleh, Merna
Keywords:	Web search engines;Search engines - Arabic countries
Issue Date:	3-Jun-2008
Abstract:	With the vast growth of the Arabic text documents stored on Web and data warehouses on the Internet, the need for powerful natural language processing methods to help people finding and managing Arabic information in an efficient is becoming more acute. Through this research we are targeting the design and implementation of text mining tools that are able to efficiently index, process, search, and categorize Arabic data in large quantities in preparation for future searches. For that we needed to deal with the challenges Arabic introduces for natural language processing and information retrieval, automatic Arabic document categorization, and Arabic query correction/suggestion and expansion. We focus on the issue of how to employ statistical/corpus based natural language processing methods to guarantee the best way of managing and finding Arabic information and the development of the corpus.
URI:	http://hdl.handle.net/20.500.11889/4393
Appears in Collections:	Fulltext Publications

File	Description	Size	Format
HumboldtKellogAbstract.pdf		286.33 kB	Adobe PDF	View/Open

328

Last Week
0

Last month
2

checked on Apr 14, 2024

84

checked on Apr 14, 2024

Check