SU@PAN'2016

Author obfuscation

Tsvetomila Mihaylova, Georgi Karadjov, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, Preslav Nakov

Research output: Contribution to journalConference article

2 Citations (Scopus)

Abstract

The anonymity of a text's writer is an important topic for some domains, such as witness protection and anonymity programs. Stylometry can be used to reveal the true author of a text even if s/he wishes to hide his/her identity. In this paper, we present our approach for hiding an author's identity by masking their style, which we developed for the Author Obfuscation task, part of the PAN-2016 competition. The approach consists of three main steps: The first one is an evaluation of different metrics in the text that can indicate authorship; the second one is application of various transformations, so that those metrics of the target text are adjusted towards the average level, while still keeping the meaning and the soundness of the text; as a final step, we are adding random noise to the text. Our system showed the best performance for masking the author style.

Original languageEnglish
Pages (from-to)956-969
Number of pages14
JournalCEUR Workshop Proceedings
Volume1609
Publication statusPublished - 1 Jan 2016

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Mihaylova, T., Karadjov, G., Kiprov, Y., Georgiev, G., Koychev, I., & Nakov, P. (2016). SU@PAN'2016: Author obfuscation. CEUR Workshop Proceedings, 1609, 956-969.

SU@PAN'2016 : Author obfuscation. / Mihaylova, Tsvetomila; Karadjov, Georgi; Kiprov, Yasen; Georgiev, Georgi; Koychev, Ivan; Nakov, Preslav.

In: CEUR Workshop Proceedings, Vol. 1609, 01.01.2016, p. 956-969.

Research output: Contribution to journalConference article

Mihaylova, T, Karadjov, G, Kiprov, Y, Georgiev, G, Koychev, I & Nakov, P 2016, 'SU@PAN'2016: Author obfuscation', CEUR Workshop Proceedings, vol. 1609, pp. 956-969.
Mihaylova T, Karadjov G, Kiprov Y, Georgiev G, Koychev I, Nakov P. SU@PAN'2016: Author obfuscation. CEUR Workshop Proceedings. 2016 Jan 1;1609:956-969.
Mihaylova, Tsvetomila ; Karadjov, Georgi ; Kiprov, Yasen ; Georgiev, Georgi ; Koychev, Ivan ; Nakov, Preslav. / SU@PAN'2016 : Author obfuscation. In: CEUR Workshop Proceedings. 2016 ; Vol. 1609. pp. 956-969.
@article{72e972df170945ecbf1502de25cc734c,
title = "SU@PAN'2016: Author obfuscation",
abstract = "The anonymity of a text's writer is an important topic for some domains, such as witness protection and anonymity programs. Stylometry can be used to reveal the true author of a text even if s/he wishes to hide his/her identity. In this paper, we present our approach for hiding an author's identity by masking their style, which we developed for the Author Obfuscation task, part of the PAN-2016 competition. The approach consists of three main steps: The first one is an evaluation of different metrics in the text that can indicate authorship; the second one is application of various transformations, so that those metrics of the target text are adjusted towards the average level, while still keeping the meaning and the soundness of the text; as a final step, we are adding random noise to the text. Our system showed the best performance for masking the author style.",
author = "Tsvetomila Mihaylova and Georgi Karadjov and Yasen Kiprov and Georgi Georgiev and Ivan Koychev and Preslav Nakov",
year = "2016",
month = "1",
day = "1",
language = "English",
volume = "1609",
pages = "956--969",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

TY - JOUR

T1 - SU@PAN'2016

T2 - Author obfuscation

AU - Mihaylova, Tsvetomila

AU - Karadjov, Georgi

AU - Kiprov, Yasen

AU - Georgiev, Georgi

AU - Koychev, Ivan

AU - Nakov, Preslav

PY - 2016/1/1

Y1 - 2016/1/1

N2 - The anonymity of a text's writer is an important topic for some domains, such as witness protection and anonymity programs. Stylometry can be used to reveal the true author of a text even if s/he wishes to hide his/her identity. In this paper, we present our approach for hiding an author's identity by masking their style, which we developed for the Author Obfuscation task, part of the PAN-2016 competition. The approach consists of three main steps: The first one is an evaluation of different metrics in the text that can indicate authorship; the second one is application of various transformations, so that those metrics of the target text are adjusted towards the average level, while still keeping the meaning and the soundness of the text; as a final step, we are adding random noise to the text. Our system showed the best performance for masking the author style.

AB - The anonymity of a text's writer is an important topic for some domains, such as witness protection and anonymity programs. Stylometry can be used to reveal the true author of a text even if s/he wishes to hide his/her identity. In this paper, we present our approach for hiding an author's identity by masking their style, which we developed for the Author Obfuscation task, part of the PAN-2016 competition. The approach consists of three main steps: The first one is an evaluation of different metrics in the text that can indicate authorship; the second one is application of various transformations, so that those metrics of the target text are adjusted towards the average level, while still keeping the meaning and the soundness of the text; as a final step, we are adding random noise to the text. Our system showed the best performance for masking the author style.

UR - http://www.scopus.com/inward/record.url?scp=85019624755&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85019624755&partnerID=8YFLogxK

M3 - Conference article

VL - 1609

SP - 956

EP - 969

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -