Microblog search and filtering with real-time dynamics based on BM25

Wei Gao, Zhongyu Wei, Kam Fai Wong

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Microblogs such as Twitter are considered faster first-hand sources of information with many real-time fashions. We report our work in the real-time ad hoc search and filtering tasks of TREC 2012 microblog track. Our system is built based on the traditional BM25 relevance model, in which specific techniques are tried out to respond to the need of finding relevant tweets. In the real-time ad hoc task, we applied a peak detection algorithm for the process of blind feedback. We also tried to automatically combine the search results of multiple retrieval techniques. In the real-time filtering pilot task, we examine the effectiveness of some typical filtering methods previously used in TREC filtering track.

Original languageEnglish
Title of host publicationSocial Media Content Analysis
Subtitle of host publicationNatural Language Processing and Beyond
PublisherWorld Scientific Publishing Co. Pte Ltd
Pages19-30
Number of pages12
ISBN (Electronic)9789813223615
ISBN (Print)9789813223608
DOIs
Publication statusPublished - 1 Jan 2017

Fingerprint

Feedback

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Gao, W., Wei, Z., & Wong, K. F. (2017). Microblog search and filtering with real-time dynamics based on BM25. In Social Media Content Analysis: Natural Language Processing and Beyond (pp. 19-30). World Scientific Publishing Co. Pte Ltd. https://doi.org/10.1142/9789813223615_0002

Microblog search and filtering with real-time dynamics based on BM25. / Gao, Wei; Wei, Zhongyu; Wong, Kam Fai.

Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific Publishing Co. Pte Ltd, 2017. p. 19-30.

Research output: Chapter in Book/Report/Conference proceedingChapter

Gao, W, Wei, Z & Wong, KF 2017, Microblog search and filtering with real-time dynamics based on BM25. in Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific Publishing Co. Pte Ltd, pp. 19-30. https://doi.org/10.1142/9789813223615_0002
Gao W, Wei Z, Wong KF. Microblog search and filtering with real-time dynamics based on BM25. In Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific Publishing Co. Pte Ltd. 2017. p. 19-30 https://doi.org/10.1142/9789813223615_0002
Gao, Wei ; Wei, Zhongyu ; Wong, Kam Fai. / Microblog search and filtering with real-time dynamics based on BM25. Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific Publishing Co. Pte Ltd, 2017. pp. 19-30
@inbook{03ef69200007422d8e2a7303fb97aeca,
title = "Microblog search and filtering with real-time dynamics based on BM25",
abstract = "Microblogs such as Twitter are considered faster first-hand sources of information with many real-time fashions. We report our work in the real-time ad hoc search and filtering tasks of TREC 2012 microblog track. Our system is built based on the traditional BM25 relevance model, in which specific techniques are tried out to respond to the need of finding relevant tweets. In the real-time ad hoc task, we applied a peak detection algorithm for the process of blind feedback. We also tried to automatically combine the search results of multiple retrieval techniques. In the real-time filtering pilot task, we examine the effectiveness of some typical filtering methods previously used in TREC filtering track.",
author = "Wei Gao and Zhongyu Wei and Wong, {Kam Fai}",
year = "2017",
month = "1",
day = "1",
doi = "10.1142/9789813223615_0002",
language = "English",
isbn = "9789813223608",
pages = "19--30",
booktitle = "Social Media Content Analysis",
publisher = "World Scientific Publishing Co. Pte Ltd",
address = "Singapore",

}

TY - CHAP

T1 - Microblog search and filtering with real-time dynamics based on BM25

AU - Gao, Wei

AU - Wei, Zhongyu

AU - Wong, Kam Fai

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Microblogs such as Twitter are considered faster first-hand sources of information with many real-time fashions. We report our work in the real-time ad hoc search and filtering tasks of TREC 2012 microblog track. Our system is built based on the traditional BM25 relevance model, in which specific techniques are tried out to respond to the need of finding relevant tweets. In the real-time ad hoc task, we applied a peak detection algorithm for the process of blind feedback. We also tried to automatically combine the search results of multiple retrieval techniques. In the real-time filtering pilot task, we examine the effectiveness of some typical filtering methods previously used in TREC filtering track.

AB - Microblogs such as Twitter are considered faster first-hand sources of information with many real-time fashions. We report our work in the real-time ad hoc search and filtering tasks of TREC 2012 microblog track. Our system is built based on the traditional BM25 relevance model, in which specific techniques are tried out to respond to the need of finding relevant tweets. In the real-time ad hoc task, we applied a peak detection algorithm for the process of blind feedback. We also tried to automatically combine the search results of multiple retrieval techniques. In the real-time filtering pilot task, we examine the effectiveness of some typical filtering methods previously used in TREC filtering track.

UR - http://www.scopus.com/inward/record.url?scp=85041634611&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85041634611&partnerID=8YFLogxK

U2 - 10.1142/9789813223615_0002

DO - 10.1142/9789813223615_0002

M3 - Chapter

SN - 9789813223608

SP - 19

EP - 30

BT - Social Media Content Analysis

PB - World Scientific Publishing Co. Pte Ltd

ER -