Extracting skill endorsements from personal communication data

Darshan M Shankara Lingappa, Gianmarco Morales, Aristides Gionis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

People are increasingly communicating and collaborating via digital platforms, such as email and messaging applications. Data exchanged on these digital communication platforms can be a treasure trove of information on people who participate in the discussions: who they are collaborating with, what they are working on, what their expertise is, and so on. Yet, personal communication data is very rarely analyzed due to the sensitivity of the information it contains. In this paper, we mine personal communication data with the goal of generating skill endorsements of the type "person A endorses person B on skill X." To address privacy concerns, we consider that each person has access only to their own data (i.e., conversations with their peers). By using our method, they can generate endorsements for their peers, which they can inspect and opt to publish. To identify meaningful skills we use a knowledge base created from the StackExchange q&a forum. We study two different approaches, one based on building a skill graph, and one based on information retrieval techniques. We find that the latter approach outperforms the graph-based algorithms when tested on a dataset of user profiles from StackOverflow. We also conduct a user study on email data and find that the information retrieval-based approach achieves a MAP@10 score of 0.617.

Original languageEnglish
Title of host publicationCIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages1961-1964
Number of pages4
Volume24-28-October-2016
ISBN (Electronic)9781450340731
DOIs
Publication statusPublished - 24 Oct 2016
Event25th ACM International Conference on Information and Knowledge Management, CIKM 2016 - Indianapolis, United States
Duration: 24 Oct 201628 Oct 2016

Other

Other25th ACM International Conference on Information and Knowledge Management, CIKM 2016
CountryUnited States
CityIndianapolis
Period24/10/1628/10/16

Fingerprint

Endorsements
Communication
Peers
Electronic mail
Graph
Information retrieval
Knowledge base
Privacy concerns
User profile
Expertise
User studies

Keywords

  • E-mail mining
  • Personal data
  • Skill endorsements

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Lingappa, D. M. S., Morales, G., & Gionis, A. (2016). Extracting skill endorsements from personal communication data. In CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management (Vol. 24-28-October-2016, pp. 1961-1964). Association for Computing Machinery. https://doi.org/10.1145/2983323.2983884

Extracting skill endorsements from personal communication data. / Lingappa, Darshan M Shankara; Morales, Gianmarco; Gionis, Aristides.

CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management. Vol. 24-28-October-2016 Association for Computing Machinery, 2016. p. 1961-1964.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lingappa, DMS, Morales, G & Gionis, A 2016, Extracting skill endorsements from personal communication data. in CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management. vol. 24-28-October-2016, Association for Computing Machinery, pp. 1961-1964, 25th ACM International Conference on Information and Knowledge Management, CIKM 2016, Indianapolis, United States, 24/10/16. https://doi.org/10.1145/2983323.2983884
Lingappa DMS, Morales G, Gionis A. Extracting skill endorsements from personal communication data. In CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management. Vol. 24-28-October-2016. Association for Computing Machinery. 2016. p. 1961-1964 https://doi.org/10.1145/2983323.2983884
Lingappa, Darshan M Shankara ; Morales, Gianmarco ; Gionis, Aristides. / Extracting skill endorsements from personal communication data. CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management. Vol. 24-28-October-2016 Association for Computing Machinery, 2016. pp. 1961-1964
@inproceedings{352b59ec848248848b860181cb971d1d,
title = "Extracting skill endorsements from personal communication data",
abstract = "People are increasingly communicating and collaborating via digital platforms, such as email and messaging applications. Data exchanged on these digital communication platforms can be a treasure trove of information on people who participate in the discussions: who they are collaborating with, what they are working on, what their expertise is, and so on. Yet, personal communication data is very rarely analyzed due to the sensitivity of the information it contains. In this paper, we mine personal communication data with the goal of generating skill endorsements of the type {"}person A endorses person B on skill X.{"} To address privacy concerns, we consider that each person has access only to their own data (i.e., conversations with their peers). By using our method, they can generate endorsements for their peers, which they can inspect and opt to publish. To identify meaningful skills we use a knowledge base created from the StackExchange q&a forum. We study two different approaches, one based on building a skill graph, and one based on information retrieval techniques. We find that the latter approach outperforms the graph-based algorithms when tested on a dataset of user profiles from StackOverflow. We also conduct a user study on email data and find that the information retrieval-based approach achieves a MAP@10 score of 0.617.",
keywords = "E-mail mining, Personal data, Skill endorsements",
author = "Lingappa, {Darshan M Shankara} and Gianmarco Morales and Aristides Gionis",
year = "2016",
month = "10",
day = "24",
doi = "10.1145/2983323.2983884",
language = "English",
volume = "24-28-October-2016",
pages = "1961--1964",
booktitle = "CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - Extracting skill endorsements from personal communication data

AU - Lingappa, Darshan M Shankara

AU - Morales, Gianmarco

AU - Gionis, Aristides

PY - 2016/10/24

Y1 - 2016/10/24

N2 - People are increasingly communicating and collaborating via digital platforms, such as email and messaging applications. Data exchanged on these digital communication platforms can be a treasure trove of information on people who participate in the discussions: who they are collaborating with, what they are working on, what their expertise is, and so on. Yet, personal communication data is very rarely analyzed due to the sensitivity of the information it contains. In this paper, we mine personal communication data with the goal of generating skill endorsements of the type "person A endorses person B on skill X." To address privacy concerns, we consider that each person has access only to their own data (i.e., conversations with their peers). By using our method, they can generate endorsements for their peers, which they can inspect and opt to publish. To identify meaningful skills we use a knowledge base created from the StackExchange q&a forum. We study two different approaches, one based on building a skill graph, and one based on information retrieval techniques. We find that the latter approach outperforms the graph-based algorithms when tested on a dataset of user profiles from StackOverflow. We also conduct a user study on email data and find that the information retrieval-based approach achieves a MAP@10 score of 0.617.

AB - People are increasingly communicating and collaborating via digital platforms, such as email and messaging applications. Data exchanged on these digital communication platforms can be a treasure trove of information on people who participate in the discussions: who they are collaborating with, what they are working on, what their expertise is, and so on. Yet, personal communication data is very rarely analyzed due to the sensitivity of the information it contains. In this paper, we mine personal communication data with the goal of generating skill endorsements of the type "person A endorses person B on skill X." To address privacy concerns, we consider that each person has access only to their own data (i.e., conversations with their peers). By using our method, they can generate endorsements for their peers, which they can inspect and opt to publish. To identify meaningful skills we use a knowledge base created from the StackExchange q&a forum. We study two different approaches, one based on building a skill graph, and one based on information retrieval techniques. We find that the latter approach outperforms the graph-based algorithms when tested on a dataset of user profiles from StackOverflow. We also conduct a user study on email data and find that the information retrieval-based approach achieves a MAP@10 score of 0.617.

KW - E-mail mining

KW - Personal data

KW - Skill endorsements

UR - http://www.scopus.com/inward/record.url?scp=84996538828&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84996538828&partnerID=8YFLogxK

U2 - 10.1145/2983323.2983884

DO - 10.1145/2983323.2983884

M3 - Conference contribution

VL - 24-28-October-2016

SP - 1961

EP - 1964

BT - CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management

PB - Association for Computing Machinery

ER -