Efficient distribution of full-fledged XQuery

Ying Zhang, Nan Tang, Peter Boncz

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

We investigate techniques to automatically decompose any XQuery query into subqueries, that can be executed near their data sources; i.e., function-shipping. In this scenario, the subqueries being executed remotely may have XML nodevalued parameters or results, that must be shipped in some way. The main challenge addressed here is to ensure that the decomposed queries properly respect XML node identity and preserve structural properties, when (parts of) XML nodes are sent over the network, effectively copying them. We start by precisely characterizing the conditions, under which pass-by-value parameter passing causes semantic differences between remote execution of an XQuery expression and its local execution. We then formulate a conservative strategy that effectively avoids decomposition in such cases. To broaden the possibilities of query distribution, we extend the pass-byvalue semantics to a pass-by-fragment semantics, which keeps better track of node identities and structural properties. The pass-by-fragment semantics is subsequently refined to a pass-byprojection semantics by means of a novel runtime XML projection technique, which safely eliminates most semantic differences between the local and remote execution of an XQuery expression, and strongly reduces message sizes. The proposed techniques are implemented in XRPC, a simple yet efficient XQuery extension that enables function-shipping by adding a Remote Procedure Call mechanism to XQuery. Experiments on MonetDB/XQuery establish the performance potential of our XQuery decomposition techniques.

Original languageEnglish
Title of host publicationProceedings - International Conference on Data Engineering
Pages565-576
Number of pages12
DOIs
Publication statusPublished - 8 Jul 2009
Externally publishedYes
Event25th IEEE International Conference on Data Engineering, ICDE 2009 - Shanghai, China
Duration: 29 Mar 20092 Apr 2009

Other

Other25th IEEE International Conference on Data Engineering, ICDE 2009
CountryChina
CityShanghai
Period29/3/092/4/09

Fingerprint

Semantics
XML
Freight transportation
Structural properties
Decomposition
Copying
Experiments

ASJC Scopus subject areas

  • Information Systems
  • Signal Processing
  • Software

Cite this

Zhang, Y., Tang, N., & Boncz, P. (2009). Efficient distribution of full-fledged XQuery. In Proceedings - International Conference on Data Engineering (pp. 565-576). [4812435] https://doi.org/10.1109/ICDE.2009.79

Efficient distribution of full-fledged XQuery. / Zhang, Ying; Tang, Nan; Boncz, Peter.

Proceedings - International Conference on Data Engineering. 2009. p. 565-576 4812435.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, Y, Tang, N & Boncz, P 2009, Efficient distribution of full-fledged XQuery. in Proceedings - International Conference on Data Engineering., 4812435, pp. 565-576, 25th IEEE International Conference on Data Engineering, ICDE 2009, Shanghai, China, 29/3/09. https://doi.org/10.1109/ICDE.2009.79
Zhang Y, Tang N, Boncz P. Efficient distribution of full-fledged XQuery. In Proceedings - International Conference on Data Engineering. 2009. p. 565-576. 4812435 https://doi.org/10.1109/ICDE.2009.79
Zhang, Ying ; Tang, Nan ; Boncz, Peter. / Efficient distribution of full-fledged XQuery. Proceedings - International Conference on Data Engineering. 2009. pp. 565-576
@inproceedings{aa755f73d4be4a78bc2e6f7956bbed06,
title = "Efficient distribution of full-fledged XQuery",
abstract = "We investigate techniques to automatically decompose any XQuery query into subqueries, that can be executed near their data sources; i.e., function-shipping. In this scenario, the subqueries being executed remotely may have XML nodevalued parameters or results, that must be shipped in some way. The main challenge addressed here is to ensure that the decomposed queries properly respect XML node identity and preserve structural properties, when (parts of) XML nodes are sent over the network, effectively copying them. We start by precisely characterizing the conditions, under which pass-by-value parameter passing causes semantic differences between remote execution of an XQuery expression and its local execution. We then formulate a conservative strategy that effectively avoids decomposition in such cases. To broaden the possibilities of query distribution, we extend the pass-byvalue semantics to a pass-by-fragment semantics, which keeps better track of node identities and structural properties. The pass-by-fragment semantics is subsequently refined to a pass-byprojection semantics by means of a novel runtime XML projection technique, which safely eliminates most semantic differences between the local and remote execution of an XQuery expression, and strongly reduces message sizes. The proposed techniques are implemented in XRPC, a simple yet efficient XQuery extension that enables function-shipping by adding a Remote Procedure Call mechanism to XQuery. Experiments on MonetDB/XQuery establish the performance potential of our XQuery decomposition techniques.",
author = "Ying Zhang and Nan Tang and Peter Boncz",
year = "2009",
month = "7",
day = "8",
doi = "10.1109/ICDE.2009.79",
language = "English",
isbn = "9780769535456",
pages = "565--576",
booktitle = "Proceedings - International Conference on Data Engineering",

}

TY - GEN

T1 - Efficient distribution of full-fledged XQuery

AU - Zhang, Ying

AU - Tang, Nan

AU - Boncz, Peter

PY - 2009/7/8

Y1 - 2009/7/8

N2 - We investigate techniques to automatically decompose any XQuery query into subqueries, that can be executed near their data sources; i.e., function-shipping. In this scenario, the subqueries being executed remotely may have XML nodevalued parameters or results, that must be shipped in some way. The main challenge addressed here is to ensure that the decomposed queries properly respect XML node identity and preserve structural properties, when (parts of) XML nodes are sent over the network, effectively copying them. We start by precisely characterizing the conditions, under which pass-by-value parameter passing causes semantic differences between remote execution of an XQuery expression and its local execution. We then formulate a conservative strategy that effectively avoids decomposition in such cases. To broaden the possibilities of query distribution, we extend the pass-byvalue semantics to a pass-by-fragment semantics, which keeps better track of node identities and structural properties. The pass-by-fragment semantics is subsequently refined to a pass-byprojection semantics by means of a novel runtime XML projection technique, which safely eliminates most semantic differences between the local and remote execution of an XQuery expression, and strongly reduces message sizes. The proposed techniques are implemented in XRPC, a simple yet efficient XQuery extension that enables function-shipping by adding a Remote Procedure Call mechanism to XQuery. Experiments on MonetDB/XQuery establish the performance potential of our XQuery decomposition techniques.

AB - We investigate techniques to automatically decompose any XQuery query into subqueries, that can be executed near their data sources; i.e., function-shipping. In this scenario, the subqueries being executed remotely may have XML nodevalued parameters or results, that must be shipped in some way. The main challenge addressed here is to ensure that the decomposed queries properly respect XML node identity and preserve structural properties, when (parts of) XML nodes are sent over the network, effectively copying them. We start by precisely characterizing the conditions, under which pass-by-value parameter passing causes semantic differences between remote execution of an XQuery expression and its local execution. We then formulate a conservative strategy that effectively avoids decomposition in such cases. To broaden the possibilities of query distribution, we extend the pass-byvalue semantics to a pass-by-fragment semantics, which keeps better track of node identities and structural properties. The pass-by-fragment semantics is subsequently refined to a pass-byprojection semantics by means of a novel runtime XML projection technique, which safely eliminates most semantic differences between the local and remote execution of an XQuery expression, and strongly reduces message sizes. The proposed techniques are implemented in XRPC, a simple yet efficient XQuery extension that enables function-shipping by adding a Remote Procedure Call mechanism to XQuery. Experiments on MonetDB/XQuery establish the performance potential of our XQuery decomposition techniques.

UR - http://www.scopus.com/inward/record.url?scp=67649651674&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67649651674&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2009.79

DO - 10.1109/ICDE.2009.79

M3 - Conference contribution

SN - 9780769535456

SP - 565

EP - 576

BT - Proceedings - International Conference on Data Engineering

ER -