Text quantification

Fabrizio Sebastiani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In a number of applications involving text classification in recent years it has been pointed out that the final goal is not determining which class (or classes) individual unlabeled documents belong to, but determining the prevalence (or “relative frequency”) of each class in the unlabeled data. The latter task is known as text quantification (or prevalence estimation, or class prior estimation). The goal of this course was to introduce the audience to the problem of quantification, techniques that have been proposed for solving it, metrics used to evaluate them, applications in fields such as information retrieval, machine learning, and data mining, and to the open problems in the area.

Original languageEnglish
Title of host publicationInformation Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers
PublisherSpringer Verlag
Volume573
ISBN (Print)9783319417172
Publication statusPublished - 2016
Event9th Russian Summer School in Information Retrieval, RuSSIR 2015 - St. Petersburg, Russian Federation
Duration: 24 Aug 201528 Aug 2015

Publication series

NameCommunications in Computer and Information Science
Volume573
ISSN (Print)18650929

Other

Other9th Russian Summer School in Information Retrieval, RuSSIR 2015
CountryRussian Federation
CitySt. Petersburg
Period24/8/1528/8/15

Fingerprint

Information retrieval
Data mining
Learning systems

Keywords

  • Text classification
  • Text quantification

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Sebastiani, F. (2016). Text quantification. In Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers (Vol. 573). (Communications in Computer and Information Science; Vol. 573). Springer Verlag.

Text quantification. / Sebastiani, Fabrizio.

Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers. Vol. 573 Springer Verlag, 2016. (Communications in Computer and Information Science; Vol. 573).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sebastiani, F 2016, Text quantification. in Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers. vol. 573, Communications in Computer and Information Science, vol. 573, Springer Verlag, 9th Russian Summer School in Information Retrieval, RuSSIR 2015, St. Petersburg, Russian Federation, 24/8/15.
Sebastiani F. Text quantification. In Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers. Vol. 573. Springer Verlag. 2016. (Communications in Computer and Information Science).
Sebastiani, Fabrizio. / Text quantification. Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers. Vol. 573 Springer Verlag, 2016. (Communications in Computer and Information Science).
@inproceedings{48d25c187e9a4864b150c00b9016a49a,
title = "Text quantification",
abstract = "In a number of applications involving text classification in recent years it has been pointed out that the final goal is not determining which class (or classes) individual unlabeled documents belong to, but determining the prevalence (or “relative frequency”) of each class in the unlabeled data. The latter task is known as text quantification (or prevalence estimation, or class prior estimation). The goal of this course was to introduce the audience to the problem of quantification, techniques that have been proposed for solving it, metrics used to evaluate them, applications in fields such as information retrieval, machine learning, and data mining, and to the open problems in the area.",
keywords = "Text classification, Text quantification",
author = "Fabrizio Sebastiani",
year = "2016",
language = "English",
isbn = "9783319417172",
volume = "573",
series = "Communications in Computer and Information Science",
publisher = "Springer Verlag",
booktitle = "Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers",

}

TY - GEN

T1 - Text quantification

AU - Sebastiani, Fabrizio

PY - 2016

Y1 - 2016

N2 - In a number of applications involving text classification in recent years it has been pointed out that the final goal is not determining which class (or classes) individual unlabeled documents belong to, but determining the prevalence (or “relative frequency”) of each class in the unlabeled data. The latter task is known as text quantification (or prevalence estimation, or class prior estimation). The goal of this course was to introduce the audience to the problem of quantification, techniques that have been proposed for solving it, metrics used to evaluate them, applications in fields such as information retrieval, machine learning, and data mining, and to the open problems in the area.

AB - In a number of applications involving text classification in recent years it has been pointed out that the final goal is not determining which class (or classes) individual unlabeled documents belong to, but determining the prevalence (or “relative frequency”) of each class in the unlabeled data. The latter task is known as text quantification (or prevalence estimation, or class prior estimation). The goal of this course was to introduce the audience to the problem of quantification, techniques that have been proposed for solving it, metrics used to evaluate them, applications in fields such as information retrieval, machine learning, and data mining, and to the open problems in the area.

KW - Text classification

KW - Text quantification

UR - http://www.scopus.com/inward/record.url?scp=84979765608&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84979765608&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9783319417172

VL - 573

T3 - Communications in Computer and Information Science

BT - Information Retrieval - 9th Russian Summer School, RuSSIR 2015, Revised Selected Papers

PB - Springer Verlag

ER -