Phoneme-based transliteration of foreign names for OOV problem

Wei Gao, Kam Fai Wong, Wai Lam

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Citations (Scopus)

Abstract

A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to the phonetic representations of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method. An efficient algorithm for aligning phoneme chunks is described. Unlike rule-based approaches, our method is data-driven. Compared to source-channel based statistical approaches, we adopt a direct transliteration model, i.e. the direction of probabilistic estimation conforms to the transliteration direction. We demonstrate comparable performance to source-channel based system.

Original languageEnglish
Title of host publicationLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
EditorsK.-Y. Su, J. Tsujii, J.-H. Lee, O.Y. Kwong
Pages110-119
Number of pages10
Volume3248
Publication statusPublished - 2005
Externally publishedYes
EventFirst International Joint Conference on Natural Language Processing - IJCNLP 2004 - Hainan Island, China
Duration: 22 Mar 200424 Mar 2004

Other

OtherFirst International Joint Conference on Natural Language Processing - IJCNLP 2004
CountryChina
CityHainan Island
Period22/3/0424/3/04

Fingerprint

Glossaries
Speech analysis
Statistical methods

ASJC Scopus subject areas

  • Hardware and Architecture

Cite this

Gao, W., Wong, K. F., & Lam, W. (2005). Phoneme-based transliteration of foreign names for OOV problem. In K-Y. Su, J. Tsujii, J-H. Lee, & O. Y. Kwong (Eds.), Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3248, pp. 110-119)

Phoneme-based transliteration of foreign names for OOV problem. / Gao, Wei; Wong, Kam Fai; Lam, Wai.

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). ed. / K.-Y. Su; J. Tsujii; J.-H. Lee; O.Y. Kwong. Vol. 3248 2005. p. 110-119.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Gao, W, Wong, KF & Lam, W 2005, Phoneme-based transliteration of foreign names for OOV problem. in K-Y Su, J Tsujii, J-H Lee & OY Kwong (eds), Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). vol. 3248, pp. 110-119, First International Joint Conference on Natural Language Processing - IJCNLP 2004, Hainan Island, China, 22/3/04.
Gao W, Wong KF, Lam W. Phoneme-based transliteration of foreign names for OOV problem. In Su K-Y, Tsujii J, Lee J-H, Kwong OY, editors, Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). Vol. 3248. 2005. p. 110-119
Gao, Wei ; Wong, Kam Fai ; Lam, Wai. / Phoneme-based transliteration of foreign names for OOV problem. Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science). editor / K.-Y. Su ; J. Tsujii ; J.-H. Lee ; O.Y. Kwong. Vol. 3248 2005. pp. 110-119
@inproceedings{b8d2b53fde694823bb3f3359d1e6c8f4,
title = "Phoneme-based transliteration of foreign names for OOV problem",
abstract = "A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to the phonetic representations of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method. An efficient algorithm for aligning phoneme chunks is described. Unlike rule-based approaches, our method is data-driven. Compared to source-channel based statistical approaches, we adopt a direct transliteration model, i.e. the direction of probabilistic estimation conforms to the transliteration direction. We demonstrate comparable performance to source-channel based system.",
author = "Wei Gao and Wong, {Kam Fai} and Wai Lam",
year = "2005",
language = "English",
volume = "3248",
pages = "110--119",
editor = "K.-Y. Su and J. Tsujii and J.-H. Lee and O.Y. Kwong",
booktitle = "Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)",

}

TY - GEN

T1 - Phoneme-based transliteration of foreign names for OOV problem

AU - Gao, Wei

AU - Wong, Kam Fai

AU - Lam, Wai

PY - 2005

Y1 - 2005

N2 - A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to the phonetic representations of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method. An efficient algorithm for aligning phoneme chunks is described. Unlike rule-based approaches, our method is data-driven. Compared to source-channel based statistical approaches, we adopt a direct transliteration model, i.e. the direction of probabilistic estimation conforms to the transliteration direction. We demonstrate comparable performance to source-channel based system.

AB - A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to the phonetic representations of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method. An efficient algorithm for aligning phoneme chunks is described. Unlike rule-based approaches, our method is data-driven. Compared to source-channel based statistical approaches, we adopt a direct transliteration model, i.e. the direction of probabilistic estimation conforms to the transliteration direction. We demonstrate comparable performance to source-channel based system.

UR - http://www.scopus.com/inward/record.url?scp=26444596109&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=26444596109&partnerID=8YFLogxK

M3 - Conference contribution

VL - 3248

SP - 110

EP - 119

BT - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

A2 - Su, K.-Y.

A2 - Tsujii, J.

A2 - Lee, J.-H.

A2 - Kwong, O.Y.

ER -