Abstract
XML filtering applications are gaining increasing popularity recently. Automata are generally adopted to construct query indexes for evaluating large numbers of XPath queries over XML streams. Usually only shallow data are observed in existing approaches. How to process deep and recursive XML data with low memory limitation efficiently is still a challenging issue. In this paper, we propose HFilter, a Hybrid Finite Automaton (HFA) based stream filtering approach, to solve this problem. We introduce the basic two-tier HFA (lazy DFA tier and NFA tier) first, which realizes data prefix sharing and memory overflow control to improve the filtering throughput. Then an optimized three-tier HFA with an extra pre-expanded DFA tier is put forward, which significantly reduces the restarting cost of HFA after memory overflow. Experiments show that our approaches work more efficiently than existing ones.
Original language | English |
---|---|
Title of host publication | Database and Expert Systems Applications |
Subtitle of host publication | 19th International Conference, DEXA 2008, Turin, Italy, September 1-5, 2008. Proceedings |
Publisher | Springer-Verlag Berlin Heidelberg |
Pages | 566-580 |
Number of pages | 15 |
Volume | LNCS 5181 |
ISBN (Electronic) | 9783540856542 |
ISBN (Print) | 3540856536, 9783540856535 |
DOIs | |
Publication status | Published - 2008 |
Externally published | Yes |
Event | 19th International Conference on Database and Expert Systems Applications - Turin, Italy Duration: 1 Sep 2008 → 5 Sep 2008 Conference number: 19 http://www.dexa.org/previous/dexa2008/index.html (Link to Conference Website) |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Publisher | Springer-Verlag Berlin Heidelberg |
Volume | 5181 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 19th International Conference on Database and Expert Systems Applications |
---|---|
Abbreviated title | DEXA 2008 |
Country/Territory | Italy |
City | Turin |
Period | 1/09/08 → 5/09/08 |
Internet address |
|