PAN@FIRE: Overview of the cross-language !ndian text re-use detection competition

Alberto Barròn-Cedeño, Paolo Rosso, Sobha Lalitha Devi, Paul Clough, Mark Stevenson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

The development of models for automatic detection of text re-use and plagiarism across languages has received increasing attention in recent years. However, the lack of an evaluation framework composed of annotated datasets has caused these efforts to be isolated. In this paper we present the CL!TR 2011 corpus, the first manually created corpus for the analysis of cross-language text re-use between English and Hindi. The corpus was used during the Cross-Language !ndian Text Re-Use Detection Competition. Here we overview the approaches applied the contestants and evaluate their quality when detecting a re-used text together with its source.

Original languageEnglish
Title of host publicationMultilingual Information Access in South Asian Languages - Second International Workshop, FIRE 2010 and Third International Workshop, FIRE 2011, Revised Selected Papers
Pages59-70
Number of pages12
DOIs
Publication statusPublished - 1 Dec 2013
Event3rd International Workshop on Multilingual Information Access in South Asian Languages, FIRE 2011 - Bombay, India
Duration: 2 Dec 20114 Dec 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7536 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd International Workshop on Multilingual Information Access in South Asian Languages, FIRE 2011
CountryIndia
CityBombay
Period2/12/114/12/11

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'PAN@FIRE: Overview of the cross-language !ndian text re-use detection competition'. Together they form a unique fingerprint.

  • Cite this

    Barròn-Cedeño, A., Rosso, P., Devi, S. L., Clough, P., & Stevenson, M. (2013). PAN@FIRE: Overview of the cross-language !ndian text re-use detection competition. In Multilingual Information Access in South Asian Languages - Second International Workshop, FIRE 2010 and Third International Workshop, FIRE 2011, Revised Selected Papers (pp. 59-70). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7536 LNCS). https://doi.org/10.1007/978-3-642-40087-2_6