Modelling the retrieval of structured documents containing texts and images

Carlo Meghini, Fabrizio Sebastiani, Umberto Straccia

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as sets of physical features of the representing objects), at the content level (as sets of properties of the repreaentedentities), and at the structure level. A uniform and powerful query language allows queries to be issued that transparently combine features pertaining to form, content and structure alike. Queries are expressions of a (fuzzy) logical language. While that part of the query that pertains to (medium-independent) content is "directly" processed by an inferential engine, that part that pertains to (medium-dependent) form is entrusted to specialised document processing procedures linked to the logical language by a procedural attachment mechanism. The model thus combines the power of state-of-the-art document processing techniques with the advantages of a clean, logically defined framework for understanding multimedia document retrieval.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages325-344
Number of pages20
Volume1324
ISBN (Print)3540635548, 9783540635543
Publication statusPublished - 1997
Externally publishedYes
Event1st European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1997 - Pisa, Italy
Duration: 1 Sep 19973 Sep 1997

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1324
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other1st European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1997
CountryItaly
CityPisa
Period1/9/973/9/97

Fingerprint

Retrieval
Query
Query languages
Processing
Modeling
Document Retrieval
Property of set
Alike
Query Language
Engines
Multimedia
Engine
Dependent
Model
Text
Form
Language
Object
Framework

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Meghini, C., Sebastiani, F., & Straccia, U. (1997). Modelling the retrieval of structured documents containing texts and images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1324, pp. 325-344). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1324). Springer Verlag.

Modelling the retrieval of structured documents containing texts and images. / Meghini, Carlo; Sebastiani, Fabrizio; Straccia, Umberto.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1324 Springer Verlag, 1997. p. 325-344 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1324).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Meghini, C, Sebastiani, F & Straccia, U 1997, Modelling the retrieval of structured documents containing texts and images. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 1324, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 1324, Springer Verlag, pp. 325-344, 1st European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1997, Pisa, Italy, 1/9/97.
Meghini C, Sebastiani F, Straccia U. Modelling the retrieval of structured documents containing texts and images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1324. Springer Verlag. 1997. p. 325-344. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Meghini, Carlo ; Sebastiani, Fabrizio ; Straccia, Umberto. / Modelling the retrieval of structured documents containing texts and images. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1324 Springer Verlag, 1997. pp. 325-344 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{50fdc7c9ea934136a34c56340dca50a4,
title = "Modelling the retrieval of structured documents containing texts and images",
abstract = "We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as sets of physical features of the representing objects), at the content level (as sets of properties of the repreaentedentities), and at the structure level. A uniform and powerful query language allows queries to be issued that transparently combine features pertaining to form, content and structure alike. Queries are expressions of a (fuzzy) logical language. While that part of the query that pertains to (medium-independent) content is {"}directly{"} processed by an inferential engine, that part that pertains to (medium-dependent) form is entrusted to specialised document processing procedures linked to the logical language by a procedural attachment mechanism. The model thus combines the power of state-of-the-art document processing techniques with the advantages of a clean, logically defined framework for understanding multimedia document retrieval.",
author = "Carlo Meghini and Fabrizio Sebastiani and Umberto Straccia",
year = "1997",
language = "English",
isbn = "3540635548",
volume = "1324",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "325--344",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Modelling the retrieval of structured documents containing texts and images

AU - Meghini, Carlo

AU - Sebastiani, Fabrizio

AU - Straccia, Umberto

PY - 1997

Y1 - 1997

N2 - We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as sets of physical features of the representing objects), at the content level (as sets of properties of the repreaentedentities), and at the structure level. A uniform and powerful query language allows queries to be issued that transparently combine features pertaining to form, content and structure alike. Queries are expressions of a (fuzzy) logical language. While that part of the query that pertains to (medium-independent) content is "directly" processed by an inferential engine, that part that pertains to (medium-dependent) form is entrusted to specialised document processing procedures linked to the logical language by a procedural attachment mechanism. The model thus combines the power of state-of-the-art document processing techniques with the advantages of a clean, logically defined framework for understanding multimedia document retrieval.

AB - We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as sets of physical features of the representing objects), at the content level (as sets of properties of the repreaentedentities), and at the structure level. A uniform and powerful query language allows queries to be issued that transparently combine features pertaining to form, content and structure alike. Queries are expressions of a (fuzzy) logical language. While that part of the query that pertains to (medium-independent) content is "directly" processed by an inferential engine, that part that pertains to (medium-dependent) form is entrusted to specialised document processing procedures linked to the logical language by a procedural attachment mechanism. The model thus combines the power of state-of-the-art document processing techniques with the advantages of a clean, logically defined framework for understanding multimedia document retrieval.

UR - http://www.scopus.com/inward/record.url?scp=84957553631&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84957553631&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84957553631

SN - 3540635548

SN - 9783540635543

VL - 1324

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 325

EP - 344

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -