Language processing for arabic microblog retrieval

Kareem Darwish, Walid Magdy, Ahmed Mourad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

31 Citations (Scopus)

Abstract

The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.

Original languageEnglish
Title of host publicationACM International Conference Proceeding Series
Pages2427-2430
Number of pages4
DOIs
Publication statusPublished - 19 Dec 2012
Event21st ACM International Conference on Information and Knowledge Management, CIKM 2012 - Maui, HI, United States
Duration: 29 Oct 20122 Nov 2012

Other

Other21st ACM International Conference on Information and Knowledge Management, CIKM 2012
CountryUnited States
CityMaui, HI
Period29/10/122/11/12

Fingerprint

Speech analysis
Processing

Keywords

  • arabic retrieval
  • arabic twitter
  • dialect arabic normalization
  • microblog search

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

Cite this

Darwish, K., Magdy, W., & Mourad, A. (2012). Language processing for arabic microblog retrieval. In ACM International Conference Proceeding Series (pp. 2427-2430) https://doi.org/10.1145/2396761.2398658

Language processing for arabic microblog retrieval. / Darwish, Kareem; Magdy, Walid; Mourad, Ahmed.

ACM International Conference Proceeding Series. 2012. p. 2427-2430.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Darwish, K, Magdy, W & Mourad, A 2012, Language processing for arabic microblog retrieval. in ACM International Conference Proceeding Series. pp. 2427-2430, 21st ACM International Conference on Information and Knowledge Management, CIKM 2012, Maui, HI, United States, 29/10/12. https://doi.org/10.1145/2396761.2398658
Darwish K, Magdy W, Mourad A. Language processing for arabic microblog retrieval. In ACM International Conference Proceeding Series. 2012. p. 2427-2430 https://doi.org/10.1145/2396761.2398658
Darwish, Kareem ; Magdy, Walid ; Mourad, Ahmed. / Language processing for arabic microblog retrieval. ACM International Conference Proceeding Series. 2012. pp. 2427-2430
@inproceedings{7009547bd1e44cf9ab62084b34249620,
title = "Language processing for arabic microblog retrieval",
abstract = "The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.",
keywords = "arabic retrieval, arabic twitter, dialect arabic normalization, microblog search",
author = "Kareem Darwish and Walid Magdy and Ahmed Mourad",
year = "2012",
month = "12",
day = "19",
doi = "10.1145/2396761.2398658",
language = "English",
isbn = "9781450311564",
pages = "2427--2430",
booktitle = "ACM International Conference Proceeding Series",

}

TY - GEN

T1 - Language processing for arabic microblog retrieval

AU - Darwish, Kareem

AU - Magdy, Walid

AU - Mourad, Ahmed

PY - 2012/12/19

Y1 - 2012/12/19

N2 - The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.

AB - The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming.

KW - arabic retrieval

KW - arabic twitter

KW - dialect arabic normalization

KW - microblog search

UR - http://www.scopus.com/inward/record.url?scp=84871048884&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84871048884&partnerID=8YFLogxK

U2 - 10.1145/2396761.2398658

DO - 10.1145/2396761.2398658

M3 - Conference contribution

AN - SCOPUS:84871048884

SN - 9781450311564

SP - 2427

EP - 2430

BT - ACM International Conference Proceeding Series

ER -