Abstract
This chapter presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this chapter the effects of the Arabic text classification on Information Retrieval have been investigated. The goal was to improve the convenience and effectiveness of information access. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A chain of experiments were carried out to test the algorithm using 242 Arabic abstracts From the Saudi Arabian National Computer Conference. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.
Original language | English |
---|---|
Title of host publication | Utilizing Information Technology Systems Across Disciplines |
Subtitle of host publication | Advancements in the Application of Computer Science |
Editors | Evon M. O. Abu-Taieh, Asim A. El-Sheikh, Jeihan Abu-Tayeh |
Publisher | IGI Global |
Chapter | 2 |
Pages | 37-44 |
Number of pages | 8 |
ISBN (Electronic) | 9781605666174 |
ISBN (Print) | 9781605666167, 1605666165, 9781616925390 |
DOIs | |
Publication status | Published - 2009 |
Externally published | Yes |