Schema independent XML compressor

Baydaa Al-Hamadani, Zhongyu Lu, Raad F. Alwan

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

XML has become the standard way for representing and transforming data over the World Wide Web. The problem with XML documents is that they have a very high ratio of redundancy, which makes these documents demanding a large storage capacity and large network band-width for transmission. This study designs a system for compressing and querying XML documents (XMLCQ) which compresses the XML document without the need to its schema or DTD to minimize the amount of technologies associated with these documents. XMLCQ first compressed the XML document by separating its data into containers according to the path of these data from the root to the leaf, then it compressed these containers using a back-end compression technique. The compressed file then could be retrieved with any kind of queries applied. Only the required information is decompressed and submitted to the user. Depending on several experiments, the query processor part of the system showed the ability to answer different kinds of queries ranging from simple exact match queries to complex ones. Furthermore, this paper introduced the idea of retrieving information from more than one compressed XML documents.

Original languageEnglish
Title of host publicationInformation Retrieval Methods for Multidisciplinary Applications
PublisherIGI Global
Chapter7
Pages95-115
Number of pages21
ISBN (Electronic)9781466638990
ISBN (Print)1466638982, 9781466638983
DOIs
Publication statusPublished - 30 Apr 2013

Fingerprint

XML
Compressors
Containers
redundancy
Redundancy
Compaction
Bandwidth
Query
experiment
ability
Experiments
Container

Cite this

Al-Hamadani, B., Lu, Z., & Alwan, R. F. (2013). Schema independent XML compressor. In Information Retrieval Methods for Multidisciplinary Applications (pp. 95-115). IGI Global. https://doi.org/10.4018/978-1-4666-3898-3.ch007
Al-Hamadani, Baydaa ; Lu, Zhongyu ; Alwan, Raad F. / Schema independent XML compressor. Information Retrieval Methods for Multidisciplinary Applications. IGI Global, 2013. pp. 95-115
@inbook{8e09daf55a6e4090b1d9ce31619afdc3,
title = "Schema independent XML compressor",
abstract = "XML has become the standard way for representing and transforming data over the World Wide Web. The problem with XML documents is that they have a very high ratio of redundancy, which makes these documents demanding a large storage capacity and large network band-width for transmission. This study designs a system for compressing and querying XML documents (XMLCQ) which compresses the XML document without the need to its schema or DTD to minimize the amount of technologies associated with these documents. XMLCQ first compressed the XML document by separating its data into containers according to the path of these data from the root to the leaf, then it compressed these containers using a back-end compression technique. The compressed file then could be retrieved with any kind of queries applied. Only the required information is decompressed and submitted to the user. Depending on several experiments, the query processor part of the system showed the ability to answer different kinds of queries ranging from simple exact match queries to complex ones. Furthermore, this paper introduced the idea of retrieving information from more than one compressed XML documents.",
author = "Baydaa Al-Hamadani and Zhongyu Lu and Alwan, {Raad F.}",
year = "2013",
month = "4",
day = "30",
doi = "10.4018/978-1-4666-3898-3.ch007",
language = "English",
isbn = "1466638982",
pages = "95--115",
booktitle = "Information Retrieval Methods for Multidisciplinary Applications",
publisher = "IGI Global",

}

Al-Hamadani, B, Lu, Z & Alwan, RF 2013, Schema independent XML compressor. in Information Retrieval Methods for Multidisciplinary Applications. IGI Global, pp. 95-115. https://doi.org/10.4018/978-1-4666-3898-3.ch007

Schema independent XML compressor. / Al-Hamadani, Baydaa; Lu, Zhongyu; Alwan, Raad F.

Information Retrieval Methods for Multidisciplinary Applications. IGI Global, 2013. p. 95-115.

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - Schema independent XML compressor

AU - Al-Hamadani, Baydaa

AU - Lu, Zhongyu

AU - Alwan, Raad F.

PY - 2013/4/30

Y1 - 2013/4/30

N2 - XML has become the standard way for representing and transforming data over the World Wide Web. The problem with XML documents is that they have a very high ratio of redundancy, which makes these documents demanding a large storage capacity and large network band-width for transmission. This study designs a system for compressing and querying XML documents (XMLCQ) which compresses the XML document without the need to its schema or DTD to minimize the amount of technologies associated with these documents. XMLCQ first compressed the XML document by separating its data into containers according to the path of these data from the root to the leaf, then it compressed these containers using a back-end compression technique. The compressed file then could be retrieved with any kind of queries applied. Only the required information is decompressed and submitted to the user. Depending on several experiments, the query processor part of the system showed the ability to answer different kinds of queries ranging from simple exact match queries to complex ones. Furthermore, this paper introduced the idea of retrieving information from more than one compressed XML documents.

AB - XML has become the standard way for representing and transforming data over the World Wide Web. The problem with XML documents is that they have a very high ratio of redundancy, which makes these documents demanding a large storage capacity and large network band-width for transmission. This study designs a system for compressing and querying XML documents (XMLCQ) which compresses the XML document without the need to its schema or DTD to minimize the amount of technologies associated with these documents. XMLCQ first compressed the XML document by separating its data into containers according to the path of these data from the root to the leaf, then it compressed these containers using a back-end compression technique. The compressed file then could be retrieved with any kind of queries applied. Only the required information is decompressed and submitted to the user. Depending on several experiments, the query processor part of the system showed the ability to answer different kinds of queries ranging from simple exact match queries to complex ones. Furthermore, this paper introduced the idea of retrieving information from more than one compressed XML documents.

UR - http://www.scopus.com/inward/record.url?scp=84944055302&partnerID=8YFLogxK

U2 - 10.4018/978-1-4666-3898-3.ch007

DO - 10.4018/978-1-4666-3898-3.ch007

M3 - Chapter

SN - 1466638982

SN - 9781466638983

SP - 95

EP - 115

BT - Information Retrieval Methods for Multidisciplinary Applications

PB - IGI Global

ER -

Al-Hamadani B, Lu Z, Alwan RF. Schema independent XML compressor. In Information Retrieval Methods for Multidisciplinary Applications. IGI Global. 2013. p. 95-115 https://doi.org/10.4018/978-1-4666-3898-3.ch007