Cross-Language frame semantics transfer in bilingual corpora

Roberto Basili, Diego De Cao, Danilo Croce, Bonaventura Coppola, Alessandro Moschitti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Recent work on the transfer of semantic information across languages has been recently applied to the development of resources annotated with Frame information for different non-English European languages. These works are based on the assumption that parallel corpora annotated for English can be used to transfer the semantic information to the other target languages. In this paper, a robust method based on a statistical machine translation step augmented with simple rule-based post-processing is presented. It alleviates problems related to preprocessing errors and the complex optimization required by syntax-dependent models of the cross-lingual mapping. Different alignment strategies are here investigated against the Europarl corpus. Results suggest that the quality of the derived annotations is surprisingly good and well suited for training semantic role labeling systems.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages332-345
Number of pages14
Volume5449 LNCS
DOIs
Publication statusPublished - 21 Jul 2009
Externally publishedYes
Event10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009 - Mexico City, Mexico
Duration: 1 Mar 20097 Mar 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5449 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009
CountryMexico
CityMexico City
Period1/3/097/3/09

Fingerprint

Semantics
Statistical Machine Translation
Robust Methods
Post-processing
Labeling
Preprocessing
Annotation
Alignment
Resources
Target
Optimization
Dependent
Processing
Corpus
Language
Model
Training
Strategy
Syntax

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Basili, R., Cao, D. D., Croce, D., Coppola, B., & Moschitti, A. (2009). Cross-Language frame semantics transfer in bilingual corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5449 LNCS, pp. 332-345). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5449 LNCS). https://doi.org/10.1007/978-3-642-00382-0_27

Cross-Language frame semantics transfer in bilingual corpora. / Basili, Roberto; Cao, Diego De; Croce, Danilo; Coppola, Bonaventura; Moschitti, Alessandro.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5449 LNCS 2009. p. 332-345 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5449 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Basili, R, Cao, DD, Croce, D, Coppola, B & Moschitti, A 2009, Cross-Language frame semantics transfer in bilingual corpora. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5449 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5449 LNCS, pp. 332-345, 10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009, Mexico City, Mexico, 1/3/09. https://doi.org/10.1007/978-3-642-00382-0_27
Basili R, Cao DD, Croce D, Coppola B, Moschitti A. Cross-Language frame semantics transfer in bilingual corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5449 LNCS. 2009. p. 332-345. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-00382-0_27
Basili, Roberto ; Cao, Diego De ; Croce, Danilo ; Coppola, Bonaventura ; Moschitti, Alessandro. / Cross-Language frame semantics transfer in bilingual corpora. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5449 LNCS 2009. pp. 332-345 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{ba6f0e62bb2b4b5a95e9b744251314af,
title = "Cross-Language frame semantics transfer in bilingual corpora",
abstract = "Recent work on the transfer of semantic information across languages has been recently applied to the development of resources annotated with Frame information for different non-English European languages. These works are based on the assumption that parallel corpora annotated for English can be used to transfer the semantic information to the other target languages. In this paper, a robust method based on a statistical machine translation step augmented with simple rule-based post-processing is presented. It alleviates problems related to preprocessing errors and the complex optimization required by syntax-dependent models of the cross-lingual mapping. Different alignment strategies are here investigated against the Europarl corpus. Results suggest that the quality of the derived annotations is surprisingly good and well suited for training semantic role labeling systems.",
author = "Roberto Basili and Cao, {Diego De} and Danilo Croce and Bonaventura Coppola and Alessandro Moschitti",
year = "2009",
month = "7",
day = "21",
doi = "10.1007/978-3-642-00382-0_27",
language = "English",
isbn = "3642003818",
volume = "5449 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "332--345",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Cross-Language frame semantics transfer in bilingual corpora

AU - Basili, Roberto

AU - Cao, Diego De

AU - Croce, Danilo

AU - Coppola, Bonaventura

AU - Moschitti, Alessandro

PY - 2009/7/21

Y1 - 2009/7/21

N2 - Recent work on the transfer of semantic information across languages has been recently applied to the development of resources annotated with Frame information for different non-English European languages. These works are based on the assumption that parallel corpora annotated for English can be used to transfer the semantic information to the other target languages. In this paper, a robust method based on a statistical machine translation step augmented with simple rule-based post-processing is presented. It alleviates problems related to preprocessing errors and the complex optimization required by syntax-dependent models of the cross-lingual mapping. Different alignment strategies are here investigated against the Europarl corpus. Results suggest that the quality of the derived annotations is surprisingly good and well suited for training semantic role labeling systems.

AB - Recent work on the transfer of semantic information across languages has been recently applied to the development of resources annotated with Frame information for different non-English European languages. These works are based on the assumption that parallel corpora annotated for English can be used to transfer the semantic information to the other target languages. In this paper, a robust method based on a statistical machine translation step augmented with simple rule-based post-processing is presented. It alleviates problems related to preprocessing errors and the complex optimization required by syntax-dependent models of the cross-lingual mapping. Different alignment strategies are here investigated against the Europarl corpus. Results suggest that the quality of the derived annotations is surprisingly good and well suited for training semantic role labeling systems.

UR - http://www.scopus.com/inward/record.url?scp=67650511219&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67650511219&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-00382-0_27

DO - 10.1007/978-3-642-00382-0_27

M3 - Conference contribution

SN - 3642003818

SN - 9783642003813

VL - 5449 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 332

EP - 345

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -