Penguins in sweaters, or serendipitous entity search on user-generated content

Ilaria Bordino, Yelena Mejova, Mounia Lalmas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Citations (Scopus)

Abstract

In many cases, when browsing the Web users are searching for specific information or answers to concrete questions. Sometimes, though, users find unexpected, yet interesting and useful results, and are encouraged to explore further. What makes a result serendipitous? We propose to answer this question by exploring the potential of entities extracted from two sources of user-generated content - Wikipedia, a user-curated online encyclopedia, and Yahoo! Answers, a more unconstrained question/answering forum - in promoting serendipitous search. In this work, the content of each data source is represented as an entity network, which is further enriched with metadata about sentiment, writing quality, and topical category. We devise an algorithm based on lazy random walk with restart to retrieve entity recommendations from the networks. We show that our method provides novel results from both datasets, compared to standard web search engines. However, unlike previous research, we find that choosing highly emotional entities does not increase user interest for many categories of entities, suggesting a more complex relationship between topic matter and the desirable metadata attributes in serendipitous search.

Original languageEnglish
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
Pages109-118
Number of pages10
DOIs
Publication statusPublished - 11 Dec 2013
Externally publishedYes
Event22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 - San Francisco, CA, United States
Duration: 27 Oct 20131 Nov 2013

Other

Other22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
CountryUnited States
CitySan Francisco, CA
Period27/10/131/11/13

Fingerprint

User-generated content
Metadata
Search engine
Emotion
Sentiment
Wikipedia
Web search
Question answering
World Wide Web
Data sources
Random walk

Keywords

  • Entity networks
  • Entity search
  • Interesting-ness
  • Metadata
  • Serendipity

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Bordino, I., Mejova, Y., & Lalmas, M. (2013). Penguins in sweaters, or serendipitous entity search on user-generated content. In International Conference on Information and Knowledge Management, Proceedings (pp. 109-118) https://doi.org/10.1145/2505515.2505680

Penguins in sweaters, or serendipitous entity search on user-generated content. / Bordino, Ilaria; Mejova, Yelena; Lalmas, Mounia.

International Conference on Information and Knowledge Management, Proceedings. 2013. p. 109-118.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bordino, I, Mejova, Y & Lalmas, M 2013, Penguins in sweaters, or serendipitous entity search on user-generated content. in International Conference on Information and Knowledge Management, Proceedings. pp. 109-118, 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013, San Francisco, CA, United States, 27/10/13. https://doi.org/10.1145/2505515.2505680
Bordino I, Mejova Y, Lalmas M. Penguins in sweaters, or serendipitous entity search on user-generated content. In International Conference on Information and Knowledge Management, Proceedings. 2013. p. 109-118 https://doi.org/10.1145/2505515.2505680
Bordino, Ilaria ; Mejova, Yelena ; Lalmas, Mounia. / Penguins in sweaters, or serendipitous entity search on user-generated content. International Conference on Information and Knowledge Management, Proceedings. 2013. pp. 109-118
@inproceedings{1e95a38ca90b40748ddde2d86d581a19,
title = "Penguins in sweaters, or serendipitous entity search on user-generated content",
abstract = "In many cases, when browsing the Web users are searching for specific information or answers to concrete questions. Sometimes, though, users find unexpected, yet interesting and useful results, and are encouraged to explore further. What makes a result serendipitous? We propose to answer this question by exploring the potential of entities extracted from two sources of user-generated content - Wikipedia, a user-curated online encyclopedia, and Yahoo! Answers, a more unconstrained question/answering forum - in promoting serendipitous search. In this work, the content of each data source is represented as an entity network, which is further enriched with metadata about sentiment, writing quality, and topical category. We devise an algorithm based on lazy random walk with restart to retrieve entity recommendations from the networks. We show that our method provides novel results from both datasets, compared to standard web search engines. However, unlike previous research, we find that choosing highly emotional entities does not increase user interest for many categories of entities, suggesting a more complex relationship between topic matter and the desirable metadata attributes in serendipitous search.",
keywords = "Entity networks, Entity search, Interesting-ness, Metadata, Serendipity",
author = "Ilaria Bordino and Yelena Mejova and Mounia Lalmas",
year = "2013",
month = "12",
day = "11",
doi = "10.1145/2505515.2505680",
language = "English",
isbn = "9781450322638",
pages = "109--118",
booktitle = "International Conference on Information and Knowledge Management, Proceedings",

}

TY - GEN

T1 - Penguins in sweaters, or serendipitous entity search on user-generated content

AU - Bordino, Ilaria

AU - Mejova, Yelena

AU - Lalmas, Mounia

PY - 2013/12/11

Y1 - 2013/12/11

N2 - In many cases, when browsing the Web users are searching for specific information or answers to concrete questions. Sometimes, though, users find unexpected, yet interesting and useful results, and are encouraged to explore further. What makes a result serendipitous? We propose to answer this question by exploring the potential of entities extracted from two sources of user-generated content - Wikipedia, a user-curated online encyclopedia, and Yahoo! Answers, a more unconstrained question/answering forum - in promoting serendipitous search. In this work, the content of each data source is represented as an entity network, which is further enriched with metadata about sentiment, writing quality, and topical category. We devise an algorithm based on lazy random walk with restart to retrieve entity recommendations from the networks. We show that our method provides novel results from both datasets, compared to standard web search engines. However, unlike previous research, we find that choosing highly emotional entities does not increase user interest for many categories of entities, suggesting a more complex relationship between topic matter and the desirable metadata attributes in serendipitous search.

AB - In many cases, when browsing the Web users are searching for specific information or answers to concrete questions. Sometimes, though, users find unexpected, yet interesting and useful results, and are encouraged to explore further. What makes a result serendipitous? We propose to answer this question by exploring the potential of entities extracted from two sources of user-generated content - Wikipedia, a user-curated online encyclopedia, and Yahoo! Answers, a more unconstrained question/answering forum - in promoting serendipitous search. In this work, the content of each data source is represented as an entity network, which is further enriched with metadata about sentiment, writing quality, and topical category. We devise an algorithm based on lazy random walk with restart to retrieve entity recommendations from the networks. We show that our method provides novel results from both datasets, compared to standard web search engines. However, unlike previous research, we find that choosing highly emotional entities does not increase user interest for many categories of entities, suggesting a more complex relationship between topic matter and the desirable metadata attributes in serendipitous search.

KW - Entity networks

KW - Entity search

KW - Interesting-ness

KW - Metadata

KW - Serendipity

UR - http://www.scopus.com/inward/record.url?scp=84889583799&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84889583799&partnerID=8YFLogxK

U2 - 10.1145/2505515.2505680

DO - 10.1145/2505515.2505680

M3 - Conference contribution

SN - 9781450322638

SP - 109

EP - 118

BT - International Conference on Information and Knowledge Management, Proceedings

ER -