An Enhanced Text-Classification-Based Arabic Information Retrieval System

Sameh Ghwanmeh, Ghassan Kanaan, Riyad Al-Shalabi, Ahmad Ababneh

Research output: Chapter in Book/Report/Conference proceedingChapter

1 Citation (Scopus)

Abstract

This chapter presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this chapter the effects of the Arabic text classification on Information Retrieval have been investigated. The goal was to improve the convenience and effectiveness of information access. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A chain of experiments were carried out to test the algorithm using 242 Arabic abstracts From the Saudi Arabian National Computer Conference. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.

Original languageEnglish
Title of host publicationUtilizing Information Technology Systems Across Disciplines
Subtitle of host publicationAdvancements in the Application of Computer Science
EditorsEvon M. O. Abu-Taieh, Asim A. El-Sheikh, Jeihan Abu-Tayeh
PublisherIGI Global
Chapter2
Pages37-44
Number of pages8
ISBN (Electronic)9781605666174
ISBN (Print)9781605666167, 1605666165, 9781616925390
DOIs
Publication statusPublished - 2009
Externally publishedYes

Fingerprint

Information retrieval systems
Automatic indexing
Information retrieval
Experiments

Cite this

Ghwanmeh, S., Kanaan, G., Al-Shalabi, R., & Ababneh, A. (2009). An Enhanced Text-Classification-Based Arabic Information Retrieval System. In E. M. O. Abu-Taieh, A. A. El-Sheikh, & J. Abu-Tayeh (Eds.), Utilizing Information Technology Systems Across Disciplines: Advancements in the Application of Computer Science (pp. 37-44). IGI Global. https://doi.org/10.4018/978-1-60566-616-7.ch002
Ghwanmeh, Sameh ; Kanaan, Ghassan ; Al-Shalabi, Riyad ; Ababneh, Ahmad. / An Enhanced Text-Classification-Based Arabic Information Retrieval System. Utilizing Information Technology Systems Across Disciplines: Advancements in the Application of Computer Science. editor / Evon M. O. Abu-Taieh ; Asim A. El-Sheikh ; Jeihan Abu-Tayeh. IGI Global, 2009. pp. 37-44
@inbook{064c436edc204cf88894759c5ec10dd8,
title = "An Enhanced Text-Classification-Based Arabic Information Retrieval System",
abstract = "This chapter presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this chapter the effects of the Arabic text classification on Information Retrieval have been investigated. The goal was to improve the convenience and effectiveness of information access. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A chain of experiments were carried out to test the algorithm using 242 Arabic abstracts From the Saudi Arabian National Computer Conference. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.",
author = "Sameh Ghwanmeh and Ghassan Kanaan and Riyad Al-Shalabi and Ahmad Ababneh",
year = "2009",
doi = "10.4018/978-1-60566-616-7.ch002",
language = "English",
isbn = "9781605666167",
pages = "37--44",
editor = "Abu-Taieh, {Evon M. O.} and El-Sheikh, {Asim A.} and Jeihan Abu-Tayeh",
booktitle = "Utilizing Information Technology Systems Across Disciplines",
publisher = "IGI Global",

}

Ghwanmeh, S, Kanaan, G, Al-Shalabi, R & Ababneh, A 2009, An Enhanced Text-Classification-Based Arabic Information Retrieval System. in EMO Abu-Taieh, AA El-Sheikh & J Abu-Tayeh (eds), Utilizing Information Technology Systems Across Disciplines: Advancements in the Application of Computer Science. IGI Global, pp. 37-44. https://doi.org/10.4018/978-1-60566-616-7.ch002

An Enhanced Text-Classification-Based Arabic Information Retrieval System. / Ghwanmeh, Sameh; Kanaan, Ghassan; Al-Shalabi, Riyad; Ababneh, Ahmad.

Utilizing Information Technology Systems Across Disciplines: Advancements in the Application of Computer Science. ed. / Evon M. O. Abu-Taieh; Asim A. El-Sheikh; Jeihan Abu-Tayeh. IGI Global, 2009. p. 37-44.

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - An Enhanced Text-Classification-Based Arabic Information Retrieval System

AU - Ghwanmeh, Sameh

AU - Kanaan, Ghassan

AU - Al-Shalabi, Riyad

AU - Ababneh, Ahmad

PY - 2009

Y1 - 2009

N2 - This chapter presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this chapter the effects of the Arabic text classification on Information Retrieval have been investigated. The goal was to improve the convenience and effectiveness of information access. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A chain of experiments were carried out to test the algorithm using 242 Arabic abstracts From the Saudi Arabian National Computer Conference. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.

AB - This chapter presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this chapter the effects of the Arabic text classification on Information Retrieval have been investigated. The goal was to improve the convenience and effectiveness of information access. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A chain of experiments were carried out to test the algorithm using 242 Arabic abstracts From the Saudi Arabian National Computer Conference. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.

UR - http://www.scopus.com/inward/record.url?scp=84900660537&partnerID=8YFLogxK

U2 - 10.4018/978-1-60566-616-7.ch002

DO - 10.4018/978-1-60566-616-7.ch002

M3 - Chapter

SN - 9781605666167

SN - 1605666165

SN - 9781616925390

SP - 37

EP - 44

BT - Utilizing Information Technology Systems Across Disciplines

A2 - Abu-Taieh, Evon M. O.

A2 - El-Sheikh, Asim A.

A2 - Abu-Tayeh, Jeihan

PB - IGI Global

ER -

Ghwanmeh S, Kanaan G, Al-Shalabi R, Ababneh A. An Enhanced Text-Classification-Based Arabic Information Retrieval System. In Abu-Taieh EMO, El-Sheikh AA, Abu-Tayeh J, editors, Utilizing Information Technology Systems Across Disciplines: Advancements in the Application of Computer Science. IGI Global. 2009. p. 37-44 https://doi.org/10.4018/978-1-60566-616-7.ch002