Multilingual document classification via transductive learning

Salvatore Romeo, Dino Ienco, Andrea Tagarelli

Research output: Contribution to journalConference article

1 Citation (Scopus)

Abstract

We present a transductive learning based framework for multilingual document classification, originally proposed in [7]. A key aspect in our approach is the use of a large-scale multilingual knowledge base, BabelNet, to support the modeling of different language-written documents into a common conceptual space, without requiring any language translation process. Results on real-world multilingual corpora have highlighted the superiority of the proposed document model against existing language-dependent representation approaches, and the significance of the transductive setting for multilingual document classification.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume1404
Publication statusPublished - 1 Jan 2015
Externally publishedYes

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Multilingual document classification via transductive learning. / Romeo, Salvatore; Ienco, Dino; Tagarelli, Andrea.

In: CEUR Workshop Proceedings, Vol. 1404, 01.01.2015.

Research output: Contribution to journalConference article

@article{4e32b35c582748b8b28a6e1b3daf206c,
title = "Multilingual document classification via transductive learning",
abstract = "We present a transductive learning based framework for multilingual document classification, originally proposed in [7]. A key aspect in our approach is the use of a large-scale multilingual knowledge base, BabelNet, to support the modeling of different language-written documents into a common conceptual space, without requiring any language translation process. Results on real-world multilingual corpora have highlighted the superiority of the proposed document model against existing language-dependent representation approaches, and the significance of the transductive setting for multilingual document classification.",
author = "Salvatore Romeo and Dino Ienco and Andrea Tagarelli",
year = "2015",
month = "1",
day = "1",
language = "English",
volume = "1404",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

TY - JOUR

T1 - Multilingual document classification via transductive learning

AU - Romeo, Salvatore

AU - Ienco, Dino

AU - Tagarelli, Andrea

PY - 2015/1/1

Y1 - 2015/1/1

N2 - We present a transductive learning based framework for multilingual document classification, originally proposed in [7]. A key aspect in our approach is the use of a large-scale multilingual knowledge base, BabelNet, to support the modeling of different language-written documents into a common conceptual space, without requiring any language translation process. Results on real-world multilingual corpora have highlighted the superiority of the proposed document model against existing language-dependent representation approaches, and the significance of the transductive setting for multilingual document classification.

AB - We present a transductive learning based framework for multilingual document classification, originally proposed in [7]. A key aspect in our approach is the use of a large-scale multilingual knowledge base, BabelNet, to support the modeling of different language-written documents into a common conceptual space, without requiring any language translation process. Results on real-world multilingual corpora have highlighted the superiority of the proposed document model against existing language-dependent representation approaches, and the significance of the transductive setting for multilingual document classification.

UR - http://www.scopus.com/inward/record.url?scp=84938519327&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84938519327&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84938519327

VL - 1404

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -