Enabling medical translation for low-resource languages

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present research towards bridging the language gap between migrant workers in Qatar and medical staff. In particular, we present the first steps towards the development of a real-world Hindi-English machine translation system for doctor-patient communication. As this is a low-resource language pair, especially for speech and for the medical domain, our initial focus has been on gathering suitable training data from various sources. We applied a variety of methods ranging from fully automatic extraction from the Web to manual annotation of test data. Moreover, we developed a method for automatically augmenting the training data with synthetically generated variants, which yielded a very sizable improvement of more than 3 BLEU points absolute.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 17th International Conference, CICLing 2016, Revised Selected Papers
PublisherSpringer Verlag
Pages3-16
Number of pages14
ISBN (Print)9783319754864
DOIs
Publication statusPublished - 1 Jan 2018
Event17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016 - Konya, Turkey
Duration: 3 Apr 20169 Apr 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9624 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016
CountryTurkey
CityKonya
Period3/4/169/4/16

    Fingerprint

Keywords

  • Doctor-patient communication
  • Hindi
  • Machine translation
  • Medical translation
  • Resource-poor languages

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Musleh, A., Durrani, N., Temnikova, I., Nakov, P., Vogel, S., & Alsaad, O. (2018). Enabling medical translation for low-resource languages. In Computational Linguistics and Intelligent Text Processing - 17th International Conference, CICLing 2016, Revised Selected Papers (pp. 3-16). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9624 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-75487-1_1