MultiView

Multilevel video content representation and retrieval

J. Fan, W. G. Aref, Ahmed Elmagarmid, M. S. Hacid, M. S. Marzouk, X. Zhu

Research output: Contribution to journalArticle

43 Citations (Scopus)

Abstract

In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques.

Original languageEnglish
Pages (from-to)895-908
Number of pages14
JournalJournal of Electronic Imaging
Volume10
Issue number4
DOIs
Publication statusPublished - 1 Oct 2001
Externally publishedYes

Fingerprint

semantics
retrieval
Semantics
shot
Agglomeration
thresholds

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Atomic and Molecular Physics, and Optics
  • Computer Vision and Pattern Recognition

Cite this

MultiView : Multilevel video content representation and retrieval. / Fan, J.; Aref, W. G.; Elmagarmid, Ahmed; Hacid, M. S.; Marzouk, M. S.; Zhu, X.

In: Journal of Electronic Imaging, Vol. 10, No. 4, 01.10.2001, p. 895-908.

Research output: Contribution to journalArticle

Fan, J. ; Aref, W. G. ; Elmagarmid, Ahmed ; Hacid, M. S. ; Marzouk, M. S. ; Zhu, X. / MultiView : Multilevel video content representation and retrieval. In: Journal of Electronic Imaging. 2001 ; Vol. 10, No. 4. pp. 895-908.
@article{ad500d2e5b0e408481456a7696b1d2fc,
title = "MultiView: Multilevel video content representation and retrieval",
abstract = "In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques.",
author = "J. Fan and Aref, {W. G.} and Ahmed Elmagarmid and Hacid, {M. S.} and Marzouk, {M. S.} and X. Zhu",
year = "2001",
month = "10",
day = "1",
doi = "10.1117/1.1406944",
language = "English",
volume = "10",
pages = "895--908",
journal = "Journal of Electronic Imaging",
issn = "1017-9909",
publisher = "SPIE",
number = "4",

}

TY - JOUR

T1 - MultiView

T2 - Multilevel video content representation and retrieval

AU - Fan, J.

AU - Aref, W. G.

AU - Elmagarmid, Ahmed

AU - Hacid, M. S.

AU - Marzouk, M. S.

AU - Zhu, X.

PY - 2001/10/1

Y1 - 2001/10/1

N2 - In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques.

AB - In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques.

UR - http://www.scopus.com/inward/record.url?scp=0035492784&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035492784&partnerID=8YFLogxK

U2 - 10.1117/1.1406944

DO - 10.1117/1.1406944

M3 - Article

VL - 10

SP - 895

EP - 908

JO - Journal of Electronic Imaging

JF - Journal of Electronic Imaging

SN - 1017-9909

IS - 4

ER -