Integration of biological data and quality-driven source negotiation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Evaluation of data non-quality in database or data warehouse systems is a preliminary stage before any data usage and analysis, moreover in the context of data integration where several sources provide more or less redundant or contradictory information items and whose quality is often unknown, imprecise and very heterogeneous. Our application domain is bioinformatics where more than five hundred of semi-structured databanks propose biological information without any quality information (i.e. metadata and statistics describing the production and the management oft he biological data). In order to facilitate the multi-source data integration in the context of distributed biological databanks, we propose a technique based on the concepts of quality contract and data source negotiation for a standard wrapper-mediator architecture. A quality source contract allows to specify quality dimensions necessary to the mediator for data extraction among several distributed resources. The source selection is dynamically computed with the contract negotiation which we propose to include into the mediation and the global query processings before data acquisition. The integration of the multi-source biological data is differed for the restitution and combination of the results of the global user’s query by techniques of data recommendation taking into account source quality requirements.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages256-269
Number of pages14
Volume2224
ISBN (Print)3540428666, 9783540428664
Publication statusPublished - 2001
Externally publishedYes
Event20th International Conference on Conceptual Modeling, ER 2001 - Yokohama, Japan
Duration: 27 Nov 200130 Nov 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2224
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other20th International Conference on Conceptual Modeling, ER 2001
CountryJapan
CityYokohama
Period27/11/0130/11/01

Fingerprint

Data integration
Data warehouses
Query processing
Bioinformatics
Metadata
Mediator
Data Integration
Data acquisition
Statistics
Information Quality
Wrapper
Mediation
Data Warehouse
Query Processing
Data Acquisition
Recommendations
Query
Unknown
Resources
Necessary

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Berti-Equille, L. (2001). Integration of biological data and quality-driven source negotiation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2224, pp. 256-269). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2224). Springer Verlag.

Integration of biological data and quality-driven source negotiation. / Berti-Equille, Laure.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2224 Springer Verlag, 2001. p. 256-269 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2224).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Berti-Equille, L 2001, Integration of biological data and quality-driven source negotiation. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 2224, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2224, Springer Verlag, pp. 256-269, 20th International Conference on Conceptual Modeling, ER 2001, Yokohama, Japan, 27/11/01.
Berti-Equille L. Integration of biological data and quality-driven source negotiation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2224. Springer Verlag. 2001. p. 256-269. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Berti-Equille, Laure. / Integration of biological data and quality-driven source negotiation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2224 Springer Verlag, 2001. pp. 256-269 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{08f20a5693304ebeabc1b8fc7ab1dbbc,
title = "Integration of biological data and quality-driven source negotiation",
abstract = "Evaluation of data non-quality in database or data warehouse systems is a preliminary stage before any data usage and analysis, moreover in the context of data integration where several sources provide more or less redundant or contradictory information items and whose quality is often unknown, imprecise and very heterogeneous. Our application domain is bioinformatics where more than five hundred of semi-structured databanks propose biological information without any quality information (i.e. metadata and statistics describing the production and the management oft he biological data). In order to facilitate the multi-source data integration in the context of distributed biological databanks, we propose a technique based on the concepts of quality contract and data source negotiation for a standard wrapper-mediator architecture. A quality source contract allows to specify quality dimensions necessary to the mediator for data extraction among several distributed resources. The source selection is dynamically computed with the contract negotiation which we propose to include into the mediation and the global query processings before data acquisition. The integration of the multi-source biological data is differed for the restitution and combination of the results of the global user’s query by techniques of data recommendation taking into account source quality requirements.",
author = "Laure Berti-Equille",
year = "2001",
language = "English",
isbn = "3540428666",
volume = "2224",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "256--269",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Integration of biological data and quality-driven source negotiation

AU - Berti-Equille, Laure

PY - 2001

Y1 - 2001

N2 - Evaluation of data non-quality in database or data warehouse systems is a preliminary stage before any data usage and analysis, moreover in the context of data integration where several sources provide more or less redundant or contradictory information items and whose quality is often unknown, imprecise and very heterogeneous. Our application domain is bioinformatics where more than five hundred of semi-structured databanks propose biological information without any quality information (i.e. metadata and statistics describing the production and the management oft he biological data). In order to facilitate the multi-source data integration in the context of distributed biological databanks, we propose a technique based on the concepts of quality contract and data source negotiation for a standard wrapper-mediator architecture. A quality source contract allows to specify quality dimensions necessary to the mediator for data extraction among several distributed resources. The source selection is dynamically computed with the contract negotiation which we propose to include into the mediation and the global query processings before data acquisition. The integration of the multi-source biological data is differed for the restitution and combination of the results of the global user’s query by techniques of data recommendation taking into account source quality requirements.

AB - Evaluation of data non-quality in database or data warehouse systems is a preliminary stage before any data usage and analysis, moreover in the context of data integration where several sources provide more or less redundant or contradictory information items and whose quality is often unknown, imprecise and very heterogeneous. Our application domain is bioinformatics where more than five hundred of semi-structured databanks propose biological information without any quality information (i.e. metadata and statistics describing the production and the management oft he biological data). In order to facilitate the multi-source data integration in the context of distributed biological databanks, we propose a technique based on the concepts of quality contract and data source negotiation for a standard wrapper-mediator architecture. A quality source contract allows to specify quality dimensions necessary to the mediator for data extraction among several distributed resources. The source selection is dynamically computed with the contract negotiation which we propose to include into the mediation and the global query processings before data acquisition. The integration of the multi-source biological data is differed for the restitution and combination of the results of the global user’s query by techniques of data recommendation taking into account source quality requirements.

UR - http://www.scopus.com/inward/record.url?scp=79959661111&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79959661111&partnerID=8YFLogxK

M3 - Conference contribution

SN - 3540428666

SN - 9783540428664

VL - 2224

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 256

EP - 269

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -