Phoneme-based transliteration of foreign names for OOV problem

Wei Gao, Kam Fai Wong, Wai Lam

Research output: Contribution to journalConference article

14 Citations (Scopus)

Abstract

A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation according to pronunciation similarities, i.e. to map phonemes comprising an English name to the phonetic representations of the corresponding Chinese name. This process is called transliteration. We present a statistical transliteration method. An efficient algorithm for aligning phoneme chunks is described. Unlike rule-based approaches, our method is data-driven. Compared to source-channel based statistical approaches, we adopt a direct transliteration model, i.e. the direction of probabilistic estimation conforms to the transliteration direction. We demonstrate comparable performance to source-channel based system.

Original languageEnglish
Pages (from-to)110-119
Number of pages10
JournalLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
Volume3248
DOIs
Publication statusPublished - 1 Jan 2005
EventFirst International Joint Conference on Natural Language Processing - IJCNLP 2004 - Hainan Island, China
Duration: 22 Mar 200424 Mar 2004

    Fingerprint

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this