CrisisLex: A lexicon for collecting and filtering Microblogged communications in crises

Alexandra Olteanu, Carlos Castillo, Fernando Diaz, Sarah Vieweg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

108 Citations (Scopus)

Abstract

Locating timely, useful information during crises and mass emergencies is critical for those forced to make potentially life-Altering decisions. As the use of Twitter to broadcast useful information during such situations becomes more widespread, the problem of finding it becomes more difficult. We describe an approach toward improving the recall in the sampling of Twitter communications that can lead to greater situational awareness during crisis situations. First, we create a lexicon of crisis-related terms that frequently appear in relevant messages posted during different types of crisis situations. Next, we demonstrate how we use the lexicon to automatically identify new terms that describe a given crisis. Finally, we explain how to efficiently query Twitter to extract crisis-related messages during emergency events. In our experiments, using a crisis lexicon leads to substantial improvements in terms of recall when added to a set of crisis-specific keywords manually chosen by experts; it also helps to preserve the original distribution of message types.

Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014
PublisherThe AAAI Press
Pages376-385
Number of pages10
ISBN (Print)9781577356578
Publication statusPublished - 1 Jan 2014
Event8th International Conference on Weblogs and Social Media, ICWSM 2014 - Ann Arbor, United States
Duration: 1 Jun 20144 Jun 2014

Other

Other8th International Conference on Weblogs and Social Media, ICWSM 2014
CountryUnited States
CityAnn Arbor
Period1/6/144/6/14

Fingerprint

Sampling
Communication
Experiments

ASJC Scopus subject areas

  • Computer Networks and Communications

Cite this

Olteanu, A., Castillo, C., Diaz, F., & Vieweg, S. (2014). CrisisLex: A lexicon for collecting and filtering Microblogged communications in crises. In Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014 (pp. 376-385). The AAAI Press.

CrisisLex : A lexicon for collecting and filtering Microblogged communications in crises. / Olteanu, Alexandra; Castillo, Carlos; Diaz, Fernando; Vieweg, Sarah.

Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014. The AAAI Press, 2014. p. 376-385.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Olteanu, A, Castillo, C, Diaz, F & Vieweg, S 2014, CrisisLex: A lexicon for collecting and filtering Microblogged communications in crises. in Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014. The AAAI Press, pp. 376-385, 8th International Conference on Weblogs and Social Media, ICWSM 2014, Ann Arbor, United States, 1/6/14.
Olteanu A, Castillo C, Diaz F, Vieweg S. CrisisLex: A lexicon for collecting and filtering Microblogged communications in crises. In Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014. The AAAI Press. 2014. p. 376-385
Olteanu, Alexandra ; Castillo, Carlos ; Diaz, Fernando ; Vieweg, Sarah. / CrisisLex : A lexicon for collecting and filtering Microblogged communications in crises. Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014. The AAAI Press, 2014. pp. 376-385
@inproceedings{f2278be62447442493cb726e6a77630a,
title = "CrisisLex: A lexicon for collecting and filtering Microblogged communications in crises",
abstract = "Locating timely, useful information during crises and mass emergencies is critical for those forced to make potentially life-Altering decisions. As the use of Twitter to broadcast useful information during such situations becomes more widespread, the problem of finding it becomes more difficult. We describe an approach toward improving the recall in the sampling of Twitter communications that can lead to greater situational awareness during crisis situations. First, we create a lexicon of crisis-related terms that frequently appear in relevant messages posted during different types of crisis situations. Next, we demonstrate how we use the lexicon to automatically identify new terms that describe a given crisis. Finally, we explain how to efficiently query Twitter to extract crisis-related messages during emergency events. In our experiments, using a crisis lexicon leads to substantial improvements in terms of recall when added to a set of crisis-specific keywords manually chosen by experts; it also helps to preserve the original distribution of message types.",
author = "Alexandra Olteanu and Carlos Castillo and Fernando Diaz and Sarah Vieweg",
year = "2014",
month = "1",
day = "1",
language = "English",
isbn = "9781577356578",
pages = "376--385",
booktitle = "Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014",
publisher = "The AAAI Press",

}

TY - GEN

T1 - CrisisLex

T2 - A lexicon for collecting and filtering Microblogged communications in crises

AU - Olteanu, Alexandra

AU - Castillo, Carlos

AU - Diaz, Fernando

AU - Vieweg, Sarah

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Locating timely, useful information during crises and mass emergencies is critical for those forced to make potentially life-Altering decisions. As the use of Twitter to broadcast useful information during such situations becomes more widespread, the problem of finding it becomes more difficult. We describe an approach toward improving the recall in the sampling of Twitter communications that can lead to greater situational awareness during crisis situations. First, we create a lexicon of crisis-related terms that frequently appear in relevant messages posted during different types of crisis situations. Next, we demonstrate how we use the lexicon to automatically identify new terms that describe a given crisis. Finally, we explain how to efficiently query Twitter to extract crisis-related messages during emergency events. In our experiments, using a crisis lexicon leads to substantial improvements in terms of recall when added to a set of crisis-specific keywords manually chosen by experts; it also helps to preserve the original distribution of message types.

AB - Locating timely, useful information during crises and mass emergencies is critical for those forced to make potentially life-Altering decisions. As the use of Twitter to broadcast useful information during such situations becomes more widespread, the problem of finding it becomes more difficult. We describe an approach toward improving the recall in the sampling of Twitter communications that can lead to greater situational awareness during crisis situations. First, we create a lexicon of crisis-related terms that frequently appear in relevant messages posted during different types of crisis situations. Next, we demonstrate how we use the lexicon to automatically identify new terms that describe a given crisis. Finally, we explain how to efficiently query Twitter to extract crisis-related messages during emergency events. In our experiments, using a crisis lexicon leads to substantial improvements in terms of recall when added to a set of crisis-specific keywords manually chosen by experts; it also helps to preserve the original distribution of message types.

UR - http://www.scopus.com/inward/record.url?scp=84909957669&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84909957669&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84909957669

SN - 9781577356578

SP - 376

EP - 385

BT - Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014

PB - The AAAI Press

ER -