Statistical denormalization for arabic text

Mohammed Moussa, Mohamed Waleed Fakhr, Kareem Darwish

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

In this paper, we focus on a sub-problem of Arabic text error correction, namely Arabic Text Denormalization. Text Denormalization is considered an important post-processing step when performing machine translation into Arabic. We examine different approaches for denormalization via the use of language modeling, stemming, and sequence labeling. We show the effectiveness of different approaches and how they can be combined to attain better results. We perform intrinsic evaluation as well as extrinsic evaluation in the context of machine translation.

Original languageEnglish
Title of host publication11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012
Pages228-232
Number of pages5
Volume5
Publication statusPublished - 1 Dec 2012
Event11th Conference on Natural Language Processing 2012: Empirical Methods in Natural Language Processing, KONVENS 2012 - Vienna, Austria
Duration: 19 Sep 201221 Sep 2012

Other

Other11th Conference on Natural Language Processing 2012: Empirical Methods in Natural Language Processing, KONVENS 2012
CountryAustria
CityVienna
Period19/9/1221/9/12

Fingerprint

Error correction
Labeling
Processing

ASJC Scopus subject areas

  • Software

Cite this

Moussa, M., Fakhr, M. W., & Darwish, K. (2012). Statistical denormalization for arabic text. In 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012 (Vol. 5, pp. 228-232)

Statistical denormalization for arabic text. / Moussa, Mohammed; Fakhr, Mohamed Waleed; Darwish, Kareem.

11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012. Vol. 5 2012. p. 228-232.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Moussa, M, Fakhr, MW & Darwish, K 2012, Statistical denormalization for arabic text. in 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012. vol. 5, pp. 228-232, 11th Conference on Natural Language Processing 2012: Empirical Methods in Natural Language Processing, KONVENS 2012, Vienna, Austria, 19/9/12.
Moussa M, Fakhr MW, Darwish K. Statistical denormalization for arabic text. In 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012. Vol. 5. 2012. p. 228-232
Moussa, Mohammed ; Fakhr, Mohamed Waleed ; Darwish, Kareem. / Statistical denormalization for arabic text. 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012. Vol. 5 2012. pp. 228-232
@inproceedings{5f92517b0f3d4dcb9ad6e5de65c474d1,
title = "Statistical denormalization for arabic text",
abstract = "In this paper, we focus on a sub-problem of Arabic text error correction, namely Arabic Text Denormalization. Text Denormalization is considered an important post-processing step when performing machine translation into Arabic. We examine different approaches for denormalization via the use of language modeling, stemming, and sequence labeling. We show the effectiveness of different approaches and how they can be combined to attain better results. We perform intrinsic evaluation as well as extrinsic evaluation in the context of machine translation.",
author = "Mohammed Moussa and Fakhr, {Mohamed Waleed} and Kareem Darwish",
year = "2012",
month = "12",
day = "1",
language = "English",
isbn = "385027005X",
volume = "5",
pages = "228--232",
booktitle = "11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012",

}

TY - GEN

T1 - Statistical denormalization for arabic text

AU - Moussa, Mohammed

AU - Fakhr, Mohamed Waleed

AU - Darwish, Kareem

PY - 2012/12/1

Y1 - 2012/12/1

N2 - In this paper, we focus on a sub-problem of Arabic text error correction, namely Arabic Text Denormalization. Text Denormalization is considered an important post-processing step when performing machine translation into Arabic. We examine different approaches for denormalization via the use of language modeling, stemming, and sequence labeling. We show the effectiveness of different approaches and how they can be combined to attain better results. We perform intrinsic evaluation as well as extrinsic evaluation in the context of machine translation.

AB - In this paper, we focus on a sub-problem of Arabic text error correction, namely Arabic Text Denormalization. Text Denormalization is considered an important post-processing step when performing machine translation into Arabic. We examine different approaches for denormalization via the use of language modeling, stemming, and sequence labeling. We show the effectiveness of different approaches and how they can be combined to attain better results. We perform intrinsic evaluation as well as extrinsic evaluation in the context of machine translation.

UR - http://www.scopus.com/inward/record.url?scp=84893312303&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893312303&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84893312303

SN - 385027005X

SN - 9783850270052

VL - 5

SP - 228

EP - 232

BT - 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012

ER -