Quality-aware integration and warehousing of genomic data

Laure Berti-Equille, Fouzia Moussouni

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.

Original languageEnglish
Title of host publicationProceedings of the 2005 International Conference on Information Quality, ICIQ 2005
Publication statusPublished - 1 Dec 2005
Externally publishedYes
Event10th International Conference on Information Quality, ICIQ 2005 - Cambridge, MA, United States
Duration: 4 Nov 20056 Nov 2005

Other

Other10th International Conference on Information Quality, ICIQ 2005
CountryUnited States
CityCambridge, MA
Period4/11/056/11/05

Fingerprint

Data warehouses
Liver
Pathology
Gene expression
XML
DNA
Genes
Health
Iron
Experiments

Keywords

  • Biological and Genomic Data
  • Data Integration
  • Data Quality
  • Data Warehouse Quality

ASJC Scopus subject areas

  • Information Systems
  • Safety, Risk, Reliability and Quality

Cite this

Berti-Equille, L., & Moussouni, F. (2005). Quality-aware integration and warehousing of genomic data. In Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005

Quality-aware integration and warehousing of genomic data. / Berti-Equille, Laure; Moussouni, Fouzia.

Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005. 2005.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Berti-Equille, L & Moussouni, F 2005, Quality-aware integration and warehousing of genomic data. in Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005. 10th International Conference on Information Quality, ICIQ 2005, Cambridge, MA, United States, 4/11/05.
Berti-Equille L, Moussouni F. Quality-aware integration and warehousing of genomic data. In Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005. 2005
Berti-Equille, Laure ; Moussouni, Fouzia. / Quality-aware integration and warehousing of genomic data. Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005. 2005.
@inproceedings{b2aaa79849a84b7c8d214610ec1f7e09,
title = "Quality-aware integration and warehousing of genomic data",
abstract = "In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.",
keywords = "Biological and Genomic Data, Data Integration, Data Quality, Data Warehouse Quality",
author = "Laure Berti-Equille and Fouzia Moussouni",
year = "2005",
month = "12",
day = "1",
language = "English",
booktitle = "Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005",

}

TY - GEN

T1 - Quality-aware integration and warehousing of genomic data

AU - Berti-Equille, Laure

AU - Moussouni, Fouzia

PY - 2005/12/1

Y1 - 2005/12/1

N2 - In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.

AB - In human health and life sciences, researchers extensively collaborate with each other, sharing biomedical and genomic data and their experimental results. This necessitates dynamically integrating different databases or warehousing them into a single repository. Based on our past experience of building a data warehouse called GEDAW (Gene Expression Data Warehouse) that stores data on genes expressed in the liver during iron overload and liver pathologies, and also relevant information from public databanks (mostly in XML format), DNA chips home experiments and medical records, we present the lessons learned, the data quality issues in this context and the current solutions we propose for integrating and warehousing biomedical data. This paper provides a functional and modular architecture for data quality enhancement and awareness in the complex processes of integration and warehousing of biomedical data.

KW - Biological and Genomic Data

KW - Data Integration

KW - Data Quality

KW - Data Warehouse Quality

UR - http://www.scopus.com/inward/record.url?scp=84871554878&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84871554878&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84871554878

BT - Proceedings of the 2005 International Conference on Information Quality, ICIQ 2005

ER -