Schema-as-you-go: On probabilistic tagging and querying of wide tables

Meiyu Lu, Divyakant Agrawal, Bing Tian Dai, Anthony K.H. Tung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

The emergence of Web 2.0 has resulted in a huge amount of heterogeneous data that are contributed by a large number of users, engendering new challenges for data management and query processing. Given that the data are unified from various sources and accessed by numerous users, providing users with a unified mediated schema as data integration is insufficient. On one hand, a deterministic mediated schema restricts users' freedom to express queries in their preferred vocabulary; on the other hand, it is not realistic for users to remember the numerous attribute names that arise from integrating various data sources. As such, a user-oriented data management and query interface is required. In this paper, we propose an out-of-the-box approach that separates users' actions from database operations. This separating layer deals with the challenges from a semantic perspective. It interprets the semantics of each data value through tags that are provided by users, and then inserts the value into the database together with these tags. When querying the database, this layer also serves as a platform for retrieving data by interpreting the semantics of the queried tags from the users. Experiments are conducted to illustrate both the effectiveness and efficiency of our approach.

Original languageEnglish
Title of host publicationProceedings of SIGMOD 2011 and PODS 2011
Pages181-192
Number of pages12
DOIs
Publication statusPublished - 11 Jul 2011
Event2011 ACM SIGMOD and 30th PODS 2011 Conference - Athens, Greece
Duration: 12 Jun 201116 Jun 2011

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Other

Other2011 ACM SIGMOD and 30th PODS 2011 Conference
CountryGreece
CityAthens
Period12/6/1116/6/11

    Fingerprint

Keywords

  • dynamic instantiation
  • probabilistic tagging
  • wide table

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Lu, M., Agrawal, D., Dai, B. T., & Tung, A. K. H. (2011). Schema-as-you-go: On probabilistic tagging and querying of wide tables. In Proceedings of SIGMOD 2011 and PODS 2011 (pp. 181-192). (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/1989323.1989343