Efficient integration and aggregation of historical information

Mirek Riedewald, Divyakant Agrawal, Amr El Abbadi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Citations (Scopus)

Abstract

Data warehouses support the analysis of historical data. This often involves aggregation over a period of time. Furthermore, data is typically incorporated in the warehouse in the increasing order of a time attribute, e.g., date of a sale or time of a temperature measurement. In this paper we propose a framework to take advantage of this append-only nature of updates due to a time attribute. The framework allows us to integrate large amounts of new data into the warehouse and generate historical summaries efficiently. Query and update costs are virtually independent from the extent of the data set in the time dimension, making our framework an attractive aggregation approach for append-only data streams. A specific instantiation of the general approach is developed for MOLAP data cubes, involving a new data structure for append-only arrays with pre-aggregated values. Our framework is applicable to point data and data with extent, e.g., hyper-rectangles.

Original languageEnglish
Title of host publicationProceedings of the ACM SIGMOD International Conference on Management of Data
EditorsM.F.B. Moon, A. Ailamaki
Pages13-24
Number of pages12
Publication statusPublished - 2002
Externally publishedYes
EventACM SIGMOD 2002 Proceedings of the ACM SIGMOD International Conference on Managment of Data - Madison, WI, United States
Duration: 3 Jun 20026 Jun 2002

Other

OtherACM SIGMOD 2002 Proceedings of the ACM SIGMOD International Conference on Managment of Data
CountryUnited States
CityMadison, WI
Period3/6/026/6/02

Fingerprint

Warehouses
Agglomeration
Data warehouses
Temperature measurement
Data structures
Sales
Costs

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Riedewald, M., Agrawal, D., & El Abbadi, A. (2002). Efficient integration and aggregation of historical information. In M. F. B. Moon, & A. Ailamaki (Eds.), Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 13-24)

Efficient integration and aggregation of historical information. / Riedewald, Mirek; Agrawal, Divyakant; El Abbadi, Amr.

Proceedings of the ACM SIGMOD International Conference on Management of Data. ed. / M.F.B. Moon; A. Ailamaki. 2002. p. 13-24.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Riedewald, M, Agrawal, D & El Abbadi, A 2002, Efficient integration and aggregation of historical information. in MFB Moon & A Ailamaki (eds), Proceedings of the ACM SIGMOD International Conference on Management of Data. pp. 13-24, ACM SIGMOD 2002 Proceedings of the ACM SIGMOD International Conference on Managment of Data, Madison, WI, United States, 3/6/02.
Riedewald M, Agrawal D, El Abbadi A. Efficient integration and aggregation of historical information. In Moon MFB, Ailamaki A, editors, Proceedings of the ACM SIGMOD International Conference on Management of Data. 2002. p. 13-24
Riedewald, Mirek ; Agrawal, Divyakant ; El Abbadi, Amr. / Efficient integration and aggregation of historical information. Proceedings of the ACM SIGMOD International Conference on Management of Data. editor / M.F.B. Moon ; A. Ailamaki. 2002. pp. 13-24
@inproceedings{1b79caede9554b68b669fe2c6b9fd570,
title = "Efficient integration and aggregation of historical information",
abstract = "Data warehouses support the analysis of historical data. This often involves aggregation over a period of time. Furthermore, data is typically incorporated in the warehouse in the increasing order of a time attribute, e.g., date of a sale or time of a temperature measurement. In this paper we propose a framework to take advantage of this append-only nature of updates due to a time attribute. The framework allows us to integrate large amounts of new data into the warehouse and generate historical summaries efficiently. Query and update costs are virtually independent from the extent of the data set in the time dimension, making our framework an attractive aggregation approach for append-only data streams. A specific instantiation of the general approach is developed for MOLAP data cubes, involving a new data structure for append-only arrays with pre-aggregated values. Our framework is applicable to point data and data with extent, e.g., hyper-rectangles.",
author = "Mirek Riedewald and Divyakant Agrawal and {El Abbadi}, Amr",
year = "2002",
language = "English",
pages = "13--24",
editor = "M.F.B. Moon and A. Ailamaki",
booktitle = "Proceedings of the ACM SIGMOD International Conference on Management of Data",

}

TY - GEN

T1 - Efficient integration and aggregation of historical information

AU - Riedewald, Mirek

AU - Agrawal, Divyakant

AU - El Abbadi, Amr

PY - 2002

Y1 - 2002

N2 - Data warehouses support the analysis of historical data. This often involves aggregation over a period of time. Furthermore, data is typically incorporated in the warehouse in the increasing order of a time attribute, e.g., date of a sale or time of a temperature measurement. In this paper we propose a framework to take advantage of this append-only nature of updates due to a time attribute. The framework allows us to integrate large amounts of new data into the warehouse and generate historical summaries efficiently. Query and update costs are virtually independent from the extent of the data set in the time dimension, making our framework an attractive aggregation approach for append-only data streams. A specific instantiation of the general approach is developed for MOLAP data cubes, involving a new data structure for append-only arrays with pre-aggregated values. Our framework is applicable to point data and data with extent, e.g., hyper-rectangles.

AB - Data warehouses support the analysis of historical data. This often involves aggregation over a period of time. Furthermore, data is typically incorporated in the warehouse in the increasing order of a time attribute, e.g., date of a sale or time of a temperature measurement. In this paper we propose a framework to take advantage of this append-only nature of updates due to a time attribute. The framework allows us to integrate large amounts of new data into the warehouse and generate historical summaries efficiently. Query and update costs are virtually independent from the extent of the data set in the time dimension, making our framework an attractive aggregation approach for append-only data streams. A specific instantiation of the general approach is developed for MOLAP data cubes, involving a new data structure for append-only arrays with pre-aggregated values. Our framework is applicable to point data and data with extent, e.g., hyper-rectangles.

UR - http://www.scopus.com/inward/record.url?scp=0036372523&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036372523&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0036372523

SP - 13

EP - 24

BT - Proceedings of the ACM SIGMOD International Conference on Management of Data

A2 - Moon, M.F.B.

A2 - Ailamaki, A.

ER -