The UPC TweetMT participation: Translating formal tweets using context information

Eva Martnez Garcia, Cristina España-Bonet, Lluis Marques

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we describe the UPC systems that participated in the TweetMT shared task. We developed two main systems that were applied to the Spanish{Catalan language pair: a state-of-the-art phrase-based statistical machine translation system and a context-aware system. In the second approach, we define the \context" for a tweet as the tweets of a user produced in the same day, and also, we study the impact of this kind of information in the final translations when using a document-level decoder. A variant of this approach considers also semantic information from bilingual embeddings.

Original languageEnglish
Title of host publicationCEUR Workshop Proceedings
PublisherCEUR-WS
Pages25-32
Number of pages8
Volume1445
Publication statusPublished - 2015
EventTweet Translation Workshop 2015, TweetMT 2015 - co-located with 31st Conference of the Spanish Society for Natural Language Processing, SEPLN 2015 - Alicante, Spain
Duration: 15 Sep 2015 → …

Other

OtherTweet Translation Workshop 2015, TweetMT 2015 - co-located with 31st Conference of the Spanish Society for Natural Language Processing, SEPLN 2015
CountrySpain
CityAlicante
Period15/9/15 → …

    Fingerprint

Keywords

  • Context Aware Translation
  • Machine Translation
  • Twitter

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Garcia, E. M., España-Bonet, C., & Marques, L. (2015). The UPC TweetMT participation: Translating formal tweets using context information. In CEUR Workshop Proceedings (Vol. 1445, pp. 25-32). CEUR-WS.