On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences

Cheng Hsin Hsu, Mohamed Hefeeda

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.

Original languageEnglish
Article number15
JournalACM Transactions on Multimedia Computing, Communications and Applications
Volume4
Issue number2
DOIs
Publication statusPublished - 1 May 2008
Externally publishedYes

Fingerprint

Video on demand
Video conferencing
Decoding

Keywords

  • Fine-grained scalable coding
  • Multimedia streaming
  • Rate-distortion models

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture

Cite this

On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences. / Hsu, Cheng Hsin; Hefeeda, Mohamed.

In: ACM Transactions on Multimedia Computing, Communications and Applications, Vol. 4, No. 2, 15, 01.05.2008.

Research output: Contribution to journalArticle

@article{aba107bf4f774efda0c2648288ef443b,
title = "On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences",
abstract = "Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.",
keywords = "Fine-grained scalable coding, Multimedia streaming, Rate-distortion models",
author = "Hsu, {Cheng Hsin} and Mohamed Hefeeda",
year = "2008",
month = "5",
day = "1",
doi = "10.1145/1352012.1352019",
language = "English",
volume = "4",
journal = "ACM Transactions on Multimedia Computing, Communications and Applications",
issn = "1551-6857",
publisher = "Association for Computing Machinery (ACM)",
number = "2",

}

TY - JOUR

T1 - On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences

AU - Hsu, Cheng Hsin

AU - Hefeeda, Mohamed

PY - 2008/5/1

Y1 - 2008/5/1

N2 - Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.

AB - Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.

KW - Fine-grained scalable coding

KW - Multimedia streaming

KW - Rate-distortion models

UR - http://www.scopus.com/inward/record.url?scp=44249090915&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=44249090915&partnerID=8YFLogxK

U2 - 10.1145/1352012.1352019

DO - 10.1145/1352012.1352019

M3 - Article

AN - SCOPUS:44249090915

VL - 4

JO - ACM Transactions on Multimedia Computing, Communications and Applications

JF - ACM Transactions on Multimedia Computing, Communications and Applications

SN - 1551-6857

IS - 2

M1 - 15

ER -