Joint topic modeling for event summarization across news and social media streams

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Social media streams such as Twitter are regarded as faster first-hand sources of information generated by massive users. The content diffused through this channel, although noisy, provides important complement and sometimes even a substitute to the traditional news media reporting. In this chapter, we describe a novel unsupervised approach based on topic modeling to summarize trending subjects by jointly discovering the representative and complementary information from news and tweets. Our method captures the content that enriches the subject matter by reinforcing the identification of complementary sentence-tweet pairs. To valuate the complementarity of a pair, we leverage topic modeling formalism by combining a two-dimensional topic-aspect model and a cross-collection approach in the multi-document summarization literature. The final summaries are generated by co-ranking the news sentences and tweets in both sides simultaneously. Experiments give promising results as compared to state-of-the-art baselines.

Original languageEnglish
Title of host publicationSocial Media Content Analysis
Subtitle of host publicationNatural Language Processing and Beyond
PublisherWorld Scientific Publishing Co. Pte Ltd
Pages321-346
Number of pages26
ISBN (Electronic)9789813223615
ISBN (Print)9789813223608
DOIs
Publication statusPublished - 1 Jan 2017

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Gao, W., Li, P., & Darwish, K. (2017). Joint topic modeling for event summarization across news and social media streams. In Social Media Content Analysis: Natural Language Processing and Beyond (pp. 321-346). World Scientific Publishing Co. Pte Ltd. https://doi.org/10.1142/9789813223615_0022