Topic extraction from microblog posts using conversation structures

Jing Li, Ming Liao, Wei Gao, Yulan He, Kam Fai Wong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.

Original languageEnglish
Title of host publication54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers
PublisherAssociation for Computational Linguistics (ACL)
Pages2114-2123
Number of pages10
Volume4
ISBN (Electronic)9781510827585
Publication statusPublished - 2016
Event54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Berlin, Germany
Duration: 7 Aug 201612 Aug 2016

Other

Other54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
CountryGermany
CityBerlin
Period7/8/1612/8/16

Fingerprint

conversation
follower
leader
lack
experiment
Follower

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Li, J., Liao, M., Gao, W., He, Y., & Wong, K. F. (2016). Topic extraction from microblog posts using conversation structures. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 4, pp. 2114-2123). Association for Computational Linguistics (ACL).

Topic extraction from microblog posts using conversation structures. / Li, Jing; Liao, Ming; Gao, Wei; He, Yulan; Wong, Kam Fai.

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers. Vol. 4 Association for Computational Linguistics (ACL), 2016. p. 2114-2123.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Li, J, Liao, M, Gao, W, He, Y & Wong, KF 2016, Topic extraction from microblog posts using conversation structures. in 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers. vol. 4, Association for Computational Linguistics (ACL), pp. 2114-2123, 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, Berlin, Germany, 7/8/16.
Li J, Liao M, Gao W, He Y, Wong KF. Topic extraction from microblog posts using conversation structures. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers. Vol. 4. Association for Computational Linguistics (ACL). 2016. p. 2114-2123
Li, Jing ; Liao, Ming ; Gao, Wei ; He, Yulan ; Wong, Kam Fai. / Topic extraction from microblog posts using conversation structures. 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers. Vol. 4 Association for Computational Linguistics (ACL), 2016. pp. 2114-2123
@inproceedings{7e3e709d62a64ad3b9323b033603eeb8,
title = "Topic extraction from microblog posts using conversation structures",
abstract = "Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.",
author = "Jing Li and Ming Liao and Wei Gao and Yulan He and Wong, {Kam Fai}",
year = "2016",
language = "English",
volume = "4",
pages = "2114--2123",
booktitle = "54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers",
publisher = "Association for Computational Linguistics (ACL)",

}

TY - GEN

T1 - Topic extraction from microblog posts using conversation structures

AU - Li, Jing

AU - Liao, Ming

AU - Gao, Wei

AU - He, Yulan

AU - Wong, Kam Fai

PY - 2016

Y1 - 2016

N2 - Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.

AB - Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.

UR - http://www.scopus.com/inward/record.url?scp=85011977266&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85011977266&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85011977266

VL - 4

SP - 2114

EP - 2123

BT - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers

PB - Association for Computational Linguistics (ACL)

ER -