Experiments in authorship-link ranking and complete author clustering

Valentin Zmiycharov, Dimitar Alexandrov, Hristo Georgiev, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, Preslav Nakov

4 Citations (Scopus)


The paper presents the approach we developed for the Authorship Link Ranking and Complete Author Clustering task at the PAN 2016 competition. Given a document collection, the task is to group documents written by the same author, so that each cluster corresponds to a different author. This task can also be viewed as one of establishing authorship links between documents. We use a combination of classification and agglomerative clustering with a rich set of features such as average sentence length, function words ratio, type-Token ratio and part of speech tags.

Original languageEnglish
Pages (from-to)1018-1023
Number of pages6
JournalCEUR Workshop Proceedings
Publication statusPublished - 1 Jan 2016


ASJC Scopus subject areas

  • Computer Science(all)

