Please use this identifier to cite or link to this item:
Title: Statistical/corpus based methods for improved bilingual (Arabic/English) web search
Authors: Yahya, Adnan
Hithnawi, Anwar
Salhi, Ali
Fawadleh, Merna
Keywords: Web search engines;Search engines - Arabic countries
Issue Date: 3-Jun-2008
Abstract: With the vast growth of the Arabic text documents stored on Web and data warehouses on the Internet, the need for powerful natural language processing methods to help people finding and managing Arabic information in an efficient is becoming more acute. Through this research we are targeting the design and implementation of text mining tools that are able to efficiently index, process, search, and categorize Arabic data in large quantities in preparation for future searches. For that we needed to deal with the challenges Arabic introduces for natural language processing and information retrieval, automatic Arabic document categorization, and Arabic query correction/suggestion and expansion. We focus on the issue of how to employ statistical/corpus based natural language processing methods to guarantee the best way of managing and finding Arabic information and the development of the corpus.
Appears in Collections:Fulltext Publications

Files in This Item:
File Description SizeFormat
HumboldtKellogAbstract.pdf286.33 kBAdobe PDFView/Open
Show full item record

Page view(s)

Last Week
Last month
checked on Dec 4, 2021


checked on Dec 4, 2021

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.