Mining translations of OOV terms from the web through cross-lingual query expansion

Ying Zhang, Fei Huang, Stephan Vogel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

32 Citations (Scopus)

Abstract

Translating out-of-vocabulary (OOV) terms is a great challenge for the Cross-lingual Information Retrieval and Data-driven Machine Translation systems. Several approaches have been proposed to mine translations for OOV terms from the web, especially from pages containing mixed languages. In this paper, we propose a novel approach to automatically translate OOV terms on the fly through cross-lingual query expansion. The proposed approach does not require any web crawling and has achieved an inclusion rate of 95% and overall translation accuracy of 90%, outperforming state-of-the-art OOV translation techniques.

Original languageEnglish
Title of host publicationSIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages669-670
Number of pages2
DOIs
Publication statusPublished - 1 Dec 2005
Externally publishedYes
Event28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2005 - Salvador, Brazil
Duration: 15 Aug 200519 Aug 2005

Other

Other28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2005
CountryBrazil
CitySalvador
Period15/8/0519/8/05

Fingerprint

Information retrieval

Keywords

  • automatic translation
  • cross-lingual IR
  • OOV terms
  • query expansion

ASJC Scopus subject areas

  • Information Systems

Cite this

Zhang, Y., Huang, F., & Vogel, S. (2005). Mining translations of OOV terms from the web through cross-lingual query expansion. In SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 669-670) https://doi.org/10.1145/1076034.1076182

Mining translations of OOV terms from the web through cross-lingual query expansion. / Zhang, Ying; Huang, Fei; Vogel, Stephan.

SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2005. p. 669-670.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, Y, Huang, F & Vogel, S 2005, Mining translations of OOV terms from the web through cross-lingual query expansion. in SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. pp. 669-670, 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2005, Salvador, Brazil, 15/8/05. https://doi.org/10.1145/1076034.1076182
Zhang Y, Huang F, Vogel S. Mining translations of OOV terms from the web through cross-lingual query expansion. In SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2005. p. 669-670 https://doi.org/10.1145/1076034.1076182
Zhang, Ying ; Huang, Fei ; Vogel, Stephan. / Mining translations of OOV terms from the web through cross-lingual query expansion. SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2005. pp. 669-670
@inproceedings{6508925eb43940d5a148b334c648186e,
title = "Mining translations of OOV terms from the web through cross-lingual query expansion",
abstract = "Translating out-of-vocabulary (OOV) terms is a great challenge for the Cross-lingual Information Retrieval and Data-driven Machine Translation systems. Several approaches have been proposed to mine translations for OOV terms from the web, especially from pages containing mixed languages. In this paper, we propose a novel approach to automatically translate OOV terms on the fly through cross-lingual query expansion. The proposed approach does not require any web crawling and has achieved an inclusion rate of 95{\%} and overall translation accuracy of 90{\%}, outperforming state-of-the-art OOV translation techniques.",
keywords = "automatic translation, cross-lingual IR, OOV terms, query expansion",
author = "Ying Zhang and Fei Huang and Stephan Vogel",
year = "2005",
month = "12",
day = "1",
doi = "10.1145/1076034.1076182",
language = "English",
isbn = "1595930345",
pages = "669--670",
booktitle = "SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval",

}

TY - GEN

T1 - Mining translations of OOV terms from the web through cross-lingual query expansion

AU - Zhang, Ying

AU - Huang, Fei

AU - Vogel, Stephan

PY - 2005/12/1

Y1 - 2005/12/1

N2 - Translating out-of-vocabulary (OOV) terms is a great challenge for the Cross-lingual Information Retrieval and Data-driven Machine Translation systems. Several approaches have been proposed to mine translations for OOV terms from the web, especially from pages containing mixed languages. In this paper, we propose a novel approach to automatically translate OOV terms on the fly through cross-lingual query expansion. The proposed approach does not require any web crawling and has achieved an inclusion rate of 95% and overall translation accuracy of 90%, outperforming state-of-the-art OOV translation techniques.

AB - Translating out-of-vocabulary (OOV) terms is a great challenge for the Cross-lingual Information Retrieval and Data-driven Machine Translation systems. Several approaches have been proposed to mine translations for OOV terms from the web, especially from pages containing mixed languages. In this paper, we propose a novel approach to automatically translate OOV terms on the fly through cross-lingual query expansion. The proposed approach does not require any web crawling and has achieved an inclusion rate of 95% and overall translation accuracy of 90%, outperforming state-of-the-art OOV translation techniques.

KW - automatic translation

KW - cross-lingual IR

KW - OOV terms

KW - query expansion

UR - http://www.scopus.com/inward/record.url?scp=80155125909&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80155125909&partnerID=8YFLogxK

U2 - 10.1145/1076034.1076182

DO - 10.1145/1076034.1076182

M3 - Conference contribution

SN - 1595930345

SN - 9781595930347

SP - 669

EP - 670

BT - SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

ER -