Experiments in authorship-link ranking and complete author clustering

Valentin Zmiycharov, Dimitar Alexandrov, Hristo Georgiev, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, Preslav Nakov

Research output: Contribution to journalConference article

3 Citations (Scopus)

Abstract

The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.

Original languageEnglish
Pages (from-to)1018-1023
Number of pages6
JournalCEUR Workshop Proceedings
Volume1609
Publication statusPublished - 1 Jan 2016

Fingerprint

Experiments

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Zmiycharov, V., Alexandrov, D., Georgiev, H., Kiprov, Y., Georgiev, G., Koychev, I., & Nakov, P. (2016). Experiments in authorship-link ranking and complete author clustering. CEUR Workshop Proceedings, 1609, 1018-1023.

Experiments in authorship-link ranking and complete author clustering. / Zmiycharov, Valentin; Alexandrov, Dimitar; Georgiev, Hristo; Kiprov, Yasen; Georgiev, Georgi; Koychev, Ivan; Nakov, Preslav.

In: CEUR Workshop Proceedings, Vol. 1609, 01.01.2016, p. 1018-1023.

Research output: Contribution to journalConference article

Zmiycharov, V, Alexandrov, D, Georgiev, H, Kiprov, Y, Georgiev, G, Koychev, I & Nakov, P 2016, 'Experiments in authorship-link ranking and complete author clustering', CEUR Workshop Proceedings, vol. 1609, pp. 1018-1023.
Zmiycharov V, Alexandrov D, Georgiev H, Kiprov Y, Georgiev G, Koychev I et al. Experiments in authorship-link ranking and complete author clustering. CEUR Workshop Proceedings. 2016 Jan 1;1609:1018-1023.
Zmiycharov, Valentin ; Alexandrov, Dimitar ; Georgiev, Hristo ; Kiprov, Yasen ; Georgiev, Georgi ; Koychev, Ivan ; Nakov, Preslav. / Experiments in authorship-link ranking and complete author clustering. In: CEUR Workshop Proceedings. 2016 ; Vol. 1609. pp. 1018-1023.
@article{0df7cab7319648bca8dbc4730037b95b,
title = "Experiments in authorship-link ranking and complete author clustering",
abstract = "The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.",
author = "Valentin Zmiycharov and Dimitar Alexandrov and Hristo Georgiev and Yasen Kiprov and Georgi Georgiev and Ivan Koychev and Preslav Nakov",
year = "2016",
month = "1",
day = "1",
language = "English",
volume = "1609",
pages = "1018--1023",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

TY - JOUR

T1 - Experiments in authorship-link ranking and complete author clustering

AU - Zmiycharov, Valentin

AU - Alexandrov, Dimitar

AU - Georgiev, Hristo

AU - Kiprov, Yasen

AU - Georgiev, Georgi

AU - Koychev, Ivan

AU - Nakov, Preslav

PY - 2016/1/1

Y1 - 2016/1/1

N2 - The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.

AB - The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.

UR - http://www.scopus.com/inward/record.url?scp=85019613416&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85019613416&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85019613416

VL - 1609

SP - 1018

EP - 1023

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -