Experiments in authorship-link ranking and complete author clustering

Valentin Zmiycharov, Dimitar Alexandrov, Hristo Georgiev, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, Preslav Nakov

Research output: Contribution to journalConference article

4 Citations (Scopus)

Abstract

The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.

Original languageEnglish
Pages (from-to)1018-1023
Number of pages6
JournalCEUR Workshop Proceedings
Volume1609
Publication statusPublished - 1 Jan 2016

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Zmiycharov, V., Alexandrov, D., Georgiev, H., Kiprov, Y., Georgiev, G., Koychev, I., & Nakov, P. (2016). Experiments in authorship-link ranking and complete author clustering. CEUR Workshop Proceedings, 1609, 1018-1023.