A preliminary study on the robustness and generalization of role sets for semantic role labeling

Beñat Zapirain, Eneko Agirre, Lluis Marques

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Most Semantic Role Labeling (SRL) systems rely on available annotated corpora, being PropBank the most widely used corpus so far. Propbank role set is based on theory-neutral numbered arguments, which are linked to fine grained verb-dependant semantic roles through the verb framesets. Recently, thematic roles from the computational verb lexicon VerbNet have been suggested to be more adequate for generalization and portability of SRL systems, since they represent a compact set of verb-independent general roles widely used in linguistic theory. Such thematic roles could also put SRL systems closer to application needs. This paper presents a comparative study of the behavior of a state-of-the-art SRL system on both role role sets based on the SemEval-2007 English dataset, which comprises the 50 most frequent verbs in PropBank.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages219-230
Number of pages12
Volume4919 LNCS
DOIs
Publication statusPublished - 27 Aug 2008
Externally publishedYes
Event9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2008 - Haifa, Israel
Duration: 17 Feb 200823 Feb 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4919 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2008
CountryIsrael
CityHaifa
Period17/2/0823/2/08

Fingerprint

Semantics
Labeling
Robustness
Number theory
Portability
Linguistics
Compact Set
Comparative Study
Generalization
Corpus

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

Zapirain, B., Agirre, E., & Marques, L. (2008). A preliminary study on the robustness and generalization of role sets for semantic role labeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4919 LNCS, pp. 219-230). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4919 LNCS). https://doi.org/10.1007/978-3-540-78135-6_19

A preliminary study on the robustness and generalization of role sets for semantic role labeling. / Zapirain, Beñat; Agirre, Eneko; Marques, Lluis.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4919 LNCS 2008. p. 219-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4919 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zapirain, B, Agirre, E & Marques, L 2008, A preliminary study on the robustness and generalization of role sets for semantic role labeling. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 4919 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4919 LNCS, pp. 219-230, 9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2008, Haifa, Israel, 17/2/08. https://doi.org/10.1007/978-3-540-78135-6_19
Zapirain B, Agirre E, Marques L. A preliminary study on the robustness and generalization of role sets for semantic role labeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4919 LNCS. 2008. p. 219-230. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-78135-6_19
Zapirain, Beñat ; Agirre, Eneko ; Marques, Lluis. / A preliminary study on the robustness and generalization of role sets for semantic role labeling. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4919 LNCS 2008. pp. 219-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{5847bf15264f43c69bb142f65d3a2b87,
title = "A preliminary study on the robustness and generalization of role sets for semantic role labeling",
abstract = "Most Semantic Role Labeling (SRL) systems rely on available annotated corpora, being PropBank the most widely used corpus so far. Propbank role set is based on theory-neutral numbered arguments, which are linked to fine grained verb-dependant semantic roles through the verb framesets. Recently, thematic roles from the computational verb lexicon VerbNet have been suggested to be more adequate for generalization and portability of SRL systems, since they represent a compact set of verb-independent general roles widely used in linguistic theory. Such thematic roles could also put SRL systems closer to application needs. This paper presents a comparative study of the behavior of a state-of-the-art SRL system on both role role sets based on the SemEval-2007 English dataset, which comprises the 50 most frequent verbs in PropBank.",
author = "Be{\~n}at Zapirain and Eneko Agirre and Lluis Marques",
year = "2008",
month = "8",
day = "27",
doi = "10.1007/978-3-540-78135-6_19",
language = "English",
isbn = "354078134X",
volume = "4919 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "219--230",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - A preliminary study on the robustness and generalization of role sets for semantic role labeling

AU - Zapirain, Beñat

AU - Agirre, Eneko

AU - Marques, Lluis

PY - 2008/8/27

Y1 - 2008/8/27

N2 - Most Semantic Role Labeling (SRL) systems rely on available annotated corpora, being PropBank the most widely used corpus so far. Propbank role set is based on theory-neutral numbered arguments, which are linked to fine grained verb-dependant semantic roles through the verb framesets. Recently, thematic roles from the computational verb lexicon VerbNet have been suggested to be more adequate for generalization and portability of SRL systems, since they represent a compact set of verb-independent general roles widely used in linguistic theory. Such thematic roles could also put SRL systems closer to application needs. This paper presents a comparative study of the behavior of a state-of-the-art SRL system on both role role sets based on the SemEval-2007 English dataset, which comprises the 50 most frequent verbs in PropBank.

AB - Most Semantic Role Labeling (SRL) systems rely on available annotated corpora, being PropBank the most widely used corpus so far. Propbank role set is based on theory-neutral numbered arguments, which are linked to fine grained verb-dependant semantic roles through the verb framesets. Recently, thematic roles from the computational verb lexicon VerbNet have been suggested to be more adequate for generalization and portability of SRL systems, since they represent a compact set of verb-independent general roles widely used in linguistic theory. Such thematic roles could also put SRL systems closer to application needs. This paper presents a comparative study of the behavior of a state-of-the-art SRL system on both role role sets based on the SemEval-2007 English dataset, which comprises the 50 most frequent verbs in PropBank.

UR - http://www.scopus.com/inward/record.url?scp=49949086920&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=49949086920&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-78135-6_19

DO - 10.1007/978-3-540-78135-6_19

M3 - Conference contribution

SN - 354078134X

SN - 9783540781349

VL - 4919 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 219

EP - 230

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -