Dealing with spurious ambiguity in learning ITG-based word alignment

Shujian Huang, Stephan Vogel, Jiajun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Word alignment has an exponentially large search space, which often makes exact inference infeasible. Recent studies have shown that inversion transduction grammars are reasonable constraints for word alignment, and that the constrained space could be efficiently searched using synchronous parsing algorithms. However, spurious ambiguity may occur in synchronous parsing and cause problems in both search efficiency and accuracy. In this paper, we conduct a detailed study of the causes of spurious ambiguity and how it effects parsing and discriminative learning. We also propose a variant of the grammar which eliminates those ambiguities. Our grammar shows advantages over previous grammars in both synthetic and real-world experiments.

Original languageEnglish
Title of host publicationACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Pages379-383
Number of pages5
Volume2
Publication statusPublished - 1 Dec 2011
Externally publishedYes
Event49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Portland, OR, United States
Duration: 19 Jun 201124 Jun 2011

Other

Other49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011
CountryUnited States
CityPortland, OR
Period19/6/1124/6/11

Fingerprint

grammar
learning
cause
efficiency
Alignment
Grammar
experiment
Parsing
Causes

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Huang, S., Vogel, S., & Chen, J. (2011). Dealing with spurious ambiguity in learning ITG-based word alignment. In ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (Vol. 2, pp. 379-383)

Dealing with spurious ambiguity in learning ITG-based word alignment. / Huang, Shujian; Vogel, Stephan; Chen, Jiajun.

ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Vol. 2 2011. p. 379-383.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Huang, S, Vogel, S & Chen, J 2011, Dealing with spurious ambiguity in learning ITG-based word alignment. in ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. vol. 2, pp. 379-383, 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011, Portland, OR, United States, 19/6/11.
Huang S, Vogel S, Chen J. Dealing with spurious ambiguity in learning ITG-based word alignment. In ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Vol. 2. 2011. p. 379-383
Huang, Shujian ; Vogel, Stephan ; Chen, Jiajun. / Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Vol. 2 2011. pp. 379-383
@inproceedings{16bbbd70d72847019c72c7b8be2cc493,
title = "Dealing with spurious ambiguity in learning ITG-based word alignment",
abstract = "Word alignment has an exponentially large search space, which often makes exact inference infeasible. Recent studies have shown that inversion transduction grammars are reasonable constraints for word alignment, and that the constrained space could be efficiently searched using synchronous parsing algorithms. However, spurious ambiguity may occur in synchronous parsing and cause problems in both search efficiency and accuracy. In this paper, we conduct a detailed study of the causes of spurious ambiguity and how it effects parsing and discriminative learning. We also propose a variant of the grammar which eliminates those ambiguities. Our grammar shows advantages over previous grammars in both synthetic and real-world experiments.",
author = "Shujian Huang and Stephan Vogel and Jiajun Chen",
year = "2011",
month = "12",
day = "1",
language = "English",
isbn = "9781932432886",
volume = "2",
pages = "379--383",
booktitle = "ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies",

}

TY - GEN

T1 - Dealing with spurious ambiguity in learning ITG-based word alignment

AU - Huang, Shujian

AU - Vogel, Stephan

AU - Chen, Jiajun

PY - 2011/12/1

Y1 - 2011/12/1

N2 - Word alignment has an exponentially large search space, which often makes exact inference infeasible. Recent studies have shown that inversion transduction grammars are reasonable constraints for word alignment, and that the constrained space could be efficiently searched using synchronous parsing algorithms. However, spurious ambiguity may occur in synchronous parsing and cause problems in both search efficiency and accuracy. In this paper, we conduct a detailed study of the causes of spurious ambiguity and how it effects parsing and discriminative learning. We also propose a variant of the grammar which eliminates those ambiguities. Our grammar shows advantages over previous grammars in both synthetic and real-world experiments.

AB - Word alignment has an exponentially large search space, which often makes exact inference infeasible. Recent studies have shown that inversion transduction grammars are reasonable constraints for word alignment, and that the constrained space could be efficiently searched using synchronous parsing algorithms. However, spurious ambiguity may occur in synchronous parsing and cause problems in both search efficiency and accuracy. In this paper, we conduct a detailed study of the causes of spurious ambiguity and how it effects parsing and discriminative learning. We also propose a variant of the grammar which eliminates those ambiguities. Our grammar shows advantages over previous grammars in both synthetic and real-world experiments.

UR - http://www.scopus.com/inward/record.url?scp=84859045764&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84859045764&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84859045764

SN - 9781932432886

VL - 2

SP - 379

EP - 383

BT - ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

ER -