Guarani: A case study in resour

Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad, Ron Zacharski

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper we describe a set of processes for the acquisition of resources for quick rampup machine translation (MT) from any language lacking significant machine tractable resources into English, using the Paraguayan indigenous language Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 monolingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annotations.

Original languageEnglish
Title of host publicationAMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation
Pages1-9
Number of pages9
Publication statusPublished - 1 Dec 2006
Externally publishedYes
Event7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006 - Cambridge, MA, United States
Duration: 8 Aug 200612 Aug 2006

Other

Other7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006
CountryUnited States
CityCambridge, MA
Period8/8/0612/8/06

Fingerprint

Resources
Indigenous Languages
Part of Speech
Language
Amharic
Entity
Parallel Corpora
Annotation
Machine Translation

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Cite this

Abdelali, A., Cowie, J., Helmreich, S., Jin, W., Milagros, M. P., Ogden, B., ... Zacharski, R. (2006). Guarani: A case study in resour. In AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation (pp. 1-9)

Guarani : A case study in resour. / Abdelali, Ahmed; Cowie, James; Helmreich, Steve; Jin, Wanying; Milagros, Maria Pilar; Ogden, Bill; Rad, Hamid Mansouri; Zacharski, Ron.

AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation. 2006. p. 1-9.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abdelali, A, Cowie, J, Helmreich, S, Jin, W, Milagros, MP, Ogden, B, Rad, HM & Zacharski, R 2006, Guarani: A case study in resour. in AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation. pp. 1-9, 7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006, Cambridge, MA, United States, 8/8/06.
Abdelali A, Cowie J, Helmreich S, Jin W, Milagros MP, Ogden B et al. Guarani: A case study in resour. In AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation. 2006. p. 1-9
Abdelali, Ahmed ; Cowie, James ; Helmreich, Steve ; Jin, Wanying ; Milagros, Maria Pilar ; Ogden, Bill ; Rad, Hamid Mansouri ; Zacharski, Ron. / Guarani : A case study in resour. AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation. 2006. pp. 1-9
@inproceedings{33e3af7888554779b54600a6c138c690,
title = "Guarani: A case study in resour",
abstract = "In this paper we describe a set of processes for the acquisition of resources for quick rampup machine translation (MT) from any language lacking significant machine tractable resources into English, using the Paraguayan indigenous language Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 monolingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annotations.",
author = "Ahmed Abdelali and James Cowie and Steve Helmreich and Wanying Jin and Milagros, {Maria Pilar} and Bill Ogden and Rad, {Hamid Mansouri} and Ron Zacharski",
year = "2006",
month = "12",
day = "1",
language = "English",
pages = "1--9",
booktitle = "AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation",

}

TY - GEN

T1 - Guarani

T2 - A case study in resour

AU - Abdelali, Ahmed

AU - Cowie, James

AU - Helmreich, Steve

AU - Jin, Wanying

AU - Milagros, Maria Pilar

AU - Ogden, Bill

AU - Rad, Hamid Mansouri

AU - Zacharski, Ron

PY - 2006/12/1

Y1 - 2006/12/1

N2 - In this paper we describe a set of processes for the acquisition of resources for quick rampup machine translation (MT) from any language lacking significant machine tractable resources into English, using the Paraguayan indigenous language Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 monolingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annotations.

AB - In this paper we describe a set of processes for the acquisition of resources for quick rampup machine translation (MT) from any language lacking significant machine tractable resources into English, using the Paraguayan indigenous language Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 monolingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annotations.

UR - http://www.scopus.com/inward/record.url?scp=84857577453&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84857577453&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84857577453

SP - 1

EP - 9

BT - AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation

ER -