Imaginary people representing real numbers

Generating personas from online social media data

J. An, H. Kwak, S. Jung, J. Salminen, M. Admad, Bernard Jansen

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

We develop a methodology to automate creating imaginary people, referred to as personas, by processing complex behavioral and demographic data of social media audiences. From a popular social media account containing more than 30 million interactions by viewers from 198 countries engaging with more than 4,200 online videos produced by a global media corporation, we demonstrate that our methodology has several novel accomplishments, including: (a) identifying distinct user behavioral segments based on the user content consumption patterns; (b) identifying impactful demographics groupings; and (c) creating rich persona descriptions by automatically adding pertinent attributes, such as names, photos, and personal characteristics. We validate our approach by implementing the methodology into an actual working system; we then evaluate it via quantitative methods by examining the accuracy of predicting content preference of personas, the stability of the personas over time, and the generalizability of the method via applying to two other datasets. Research findings show the approach can develop rich personas representing the behavior and demographics of real audiences using privacy-preserving aggregated online social media data from major online platforms. Results have implications for media companies and other organizations distributing content via online platforms.

Original languageEnglish
Article number27
JournalACM Transactions on the Web
Volume12
Issue number4
DOIs
Publication statusPublished - 1 Nov 2018

Fingerprint

Industry
Processing

Keywords

  • Persona
  • User analytics

ASJC Scopus subject areas

  • Computer Networks and Communications

Cite this

Imaginary people representing real numbers : Generating personas from online social media data. / An, J.; Kwak, H.; Jung, S.; Salminen, J.; Admad, M.; Jansen, Bernard.

In: ACM Transactions on the Web, Vol. 12, No. 4, 27, 01.11.2018.

Research output: Contribution to journalArticle

@article{2ff01e6a3dc347408dcca3f31f64f62d,
title = "Imaginary people representing real numbers: Generating personas from online social media data",
abstract = "We develop a methodology to automate creating imaginary people, referred to as personas, by processing complex behavioral and demographic data of social media audiences. From a popular social media account containing more than 30 million interactions by viewers from 198 countries engaging with more than 4,200 online videos produced by a global media corporation, we demonstrate that our methodology has several novel accomplishments, including: (a) identifying distinct user behavioral segments based on the user content consumption patterns; (b) identifying impactful demographics groupings; and (c) creating rich persona descriptions by automatically adding pertinent attributes, such as names, photos, and personal characteristics. We validate our approach by implementing the methodology into an actual working system; we then evaluate it via quantitative methods by examining the accuracy of predicting content preference of personas, the stability of the personas over time, and the generalizability of the method via applying to two other datasets. Research findings show the approach can develop rich personas representing the behavior and demographics of real audiences using privacy-preserving aggregated online social media data from major online platforms. Results have implications for media companies and other organizations distributing content via online platforms.",
keywords = "Persona, User analytics",
author = "J. An and H. Kwak and S. Jung and J. Salminen and M. Admad and Bernard Jansen",
year = "2018",
month = "11",
day = "1",
doi = "10.1145/3265986",
language = "English",
volume = "12",
journal = "ACM Transactions on the Web",
issn = "1559-1131",
publisher = "Association for Computing Machinery (ACM)",
number = "4",

}

TY - JOUR

T1 - Imaginary people representing real numbers

T2 - Generating personas from online social media data

AU - An, J.

AU - Kwak, H.

AU - Jung, S.

AU - Salminen, J.

AU - Admad, M.

AU - Jansen, Bernard

PY - 2018/11/1

Y1 - 2018/11/1

N2 - We develop a methodology to automate creating imaginary people, referred to as personas, by processing complex behavioral and demographic data of social media audiences. From a popular social media account containing more than 30 million interactions by viewers from 198 countries engaging with more than 4,200 online videos produced by a global media corporation, we demonstrate that our methodology has several novel accomplishments, including: (a) identifying distinct user behavioral segments based on the user content consumption patterns; (b) identifying impactful demographics groupings; and (c) creating rich persona descriptions by automatically adding pertinent attributes, such as names, photos, and personal characteristics. We validate our approach by implementing the methodology into an actual working system; we then evaluate it via quantitative methods by examining the accuracy of predicting content preference of personas, the stability of the personas over time, and the generalizability of the method via applying to two other datasets. Research findings show the approach can develop rich personas representing the behavior and demographics of real audiences using privacy-preserving aggregated online social media data from major online platforms. Results have implications for media companies and other organizations distributing content via online platforms.

AB - We develop a methodology to automate creating imaginary people, referred to as personas, by processing complex behavioral and demographic data of social media audiences. From a popular social media account containing more than 30 million interactions by viewers from 198 countries engaging with more than 4,200 online videos produced by a global media corporation, we demonstrate that our methodology has several novel accomplishments, including: (a) identifying distinct user behavioral segments based on the user content consumption patterns; (b) identifying impactful demographics groupings; and (c) creating rich persona descriptions by automatically adding pertinent attributes, such as names, photos, and personal characteristics. We validate our approach by implementing the methodology into an actual working system; we then evaluate it via quantitative methods by examining the accuracy of predicting content preference of personas, the stability of the personas over time, and the generalizability of the method via applying to two other datasets. Research findings show the approach can develop rich personas representing the behavior and demographics of real audiences using privacy-preserving aggregated online social media data from major online platforms. Results have implications for media companies and other organizations distributing content via online platforms.

KW - Persona

KW - User analytics

UR - http://www.scopus.com/inward/record.url?scp=85060866682&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85060866682&partnerID=8YFLogxK

U2 - 10.1145/3265986

DO - 10.1145/3265986

M3 - Article

VL - 12

JO - ACM Transactions on the Web

JF - ACM Transactions on the Web

SN - 1559-1131

IS - 4

M1 - 27

ER -