A model of multimedia information retrieval

Carlo Meghini, Fabrizio Sebastiani, Umberto Straccia

Research output: Contribution to journalArticle

98 Citations (Scopus)

Abstract

Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.

Original languageEnglish
Pages (from-to)909-970
Number of pages62
JournalJournal of the ACM
Volume48
Issue number5
DOIs
Publication statusPublished - Sep 2001
Externally publishedYes

Fingerprint

Information retrieval
Information Retrieval
Multimedia
Retrieval
Image Retrieval
Image retrieval
Text Retrieval
Decomposition Techniques
Semantics
Query Evaluation
Relevance Feedback
Model
Decomposition
Description Logics
Conceptual Model
Fragmentation
Long-run
Fuzzy Logic
Processing
Computer Science

Keywords

  • Description logics
  • Fuzzy logics
  • Multimedia information retrieval

ASJC Scopus subject areas

  • Hardware and Architecture
  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Software
  • Theoretical Computer Science
  • Computational Theory and Mathematics

Cite this

Meghini, C., Sebastiani, F., & Straccia, U. (2001). A model of multimedia information retrieval. Journal of the ACM, 48(5), 909-970. https://doi.org/10.1145/502102.502103

A model of multimedia information retrieval. / Meghini, Carlo; Sebastiani, Fabrizio; Straccia, Umberto.

In: Journal of the ACM, Vol. 48, No. 5, 09.2001, p. 909-970.

Research output: Contribution to journalArticle

Meghini, C, Sebastiani, F & Straccia, U 2001, 'A model of multimedia information retrieval', Journal of the ACM, vol. 48, no. 5, pp. 909-970. https://doi.org/10.1145/502102.502103
Meghini, Carlo ; Sebastiani, Fabrizio ; Straccia, Umberto. / A model of multimedia information retrieval. In: Journal of the ACM. 2001 ; Vol. 48, No. 5. pp. 909-970.
@article{45128a52b46940a78b512ad947148a8a,
title = "A model of multimedia information retrieval",
abstract = "Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.",
keywords = "Description logics, Fuzzy logics, Multimedia information retrieval",
author = "Carlo Meghini and Fabrizio Sebastiani and Umberto Straccia",
year = "2001",
month = "9",
doi = "10.1145/502102.502103",
language = "English",
volume = "48",
pages = "909--970",
journal = "Journal of the ACM",
issn = "0004-5411",
publisher = "Association for Computing Machinery (ACM)",
number = "5",

}

TY - JOUR

T1 - A model of multimedia information retrieval

AU - Meghini, Carlo

AU - Sebastiani, Fabrizio

AU - Straccia, Umberto

PY - 2001/9

Y1 - 2001/9

N2 - Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.

AB - Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.

KW - Description logics

KW - Fuzzy logics

KW - Multimedia information retrieval

UR - http://www.scopus.com/inward/record.url?scp=0038167068&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0038167068&partnerID=8YFLogxK

U2 - 10.1145/502102.502103

DO - 10.1145/502102.502103

M3 - Article

VL - 48

SP - 909

EP - 970

JO - Journal of the ACM

JF - Journal of the ACM

SN - 0004-5411

IS - 5

ER -