Quality-aware integration and warehousing of genomic data

Laure Berti-Equille, Fouzia Moussouni

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.

Original languageEnglish
Title of host publicationProceedings of the 2005 International Conference on Information Quality, ICIQ 2005
Publication statusPublished - 1 Dec 2005
Externally publishedYes
Event10th International Conference on Information Quality, ICIQ 2005 - Cambridge, MA, United States
Duration: 4 Nov 20056 Nov 2005

Other

Other10th International Conference on Information Quality, ICIQ 2005
CountryUnited States
CityCambridge, MA
Period4/11/056/11/05

    Fingerprint

Keywords

  • Biological and Genomic Data
  • Data Integration
  • Data Quality
  • Data Warehouse Quality

ASJC Scopus subject areas

  • Information Systems
  • Safety, Risk, Reliability and Quality

Cite this

Berti-Equille, L., & Moussouni, F. (2005). Quality-aware integration and warehousing of genomic data. In Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005