Mining translations of OOV terms from the web through cross-lingual query expansion

Ying Zhang, Fei Huang, Stephan Vogel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

33 Citations (Scopus)

Abstract

Translating out-of-vocabulary (OOV) terms is a great challenge for the Cross-lingual Information Retrieval and Data-driven Machine Translation systems. Several approaches have been proposed to mine translations for OOV terms from the web, especially from pages containing mixed languages. In this paper, we propose a novel approach to automatically translate OOV terms on the fly through cross-lingual query expansion. The proposed approach does not require any web crawling and has achieved an inclusion rate of 95% and overall translation accuracy of 90%, outperforming state-of-the-art OOV translation techniques.

Original languageEnglish
Title of host publicationSIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages669-670
Number of pages2
DOIs
Publication statusPublished - 1 Dec 2005
Event28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2005 - Salvador, Brazil
Duration: 15 Aug 200519 Aug 2005

Publication series

NameSIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Other

Other28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2005
CountryBrazil
CitySalvador
Period15/8/0519/8/05

    Fingerprint

Keywords

  • OOV terms
  • automatic translation
  • cross-lingual IR
  • query expansion

ASJC Scopus subject areas

  • Information Systems

Cite this

Zhang, Y., Huang, F., & Vogel, S. (2005). Mining translations of OOV terms from the web through cross-lingual query expansion. In SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 669-670). (SIGIR 2005 - Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval). https://doi.org/10.1145/1076034.1076182