Tree kernels for semantic role labeling

Alessandro Moschitti, Daniele Pighin, Roberto Basili

Research output: Contribution to journalArticle

106 Citations (Scopus)


The availability of large scale data sets of manually annotated predicate-argument structures has recently favored the use of machine learning approaches to the design of automated semantic role labeling (SRL) systems. The main research in this area relates to the design choices for feature representation and for effective decompositions of the task in different learning models. Regarding the former choice, structural properties of full syntactic parses are largely employed as they represent ways to encode different principles suggested by the linking theory between syntax and semantics. The latter choice relates to several learning schemes over global views of the parses. For example, re-ranking stages operating over alternative predicate-argument sequences of the same sentence have shown to be very effective. In this article, we propose several kernel functions to model parse tree properties in kernelbased machines, for example, perceptrons or support vector machines. In particular, we define different kinds of tree kernels as general approaches to feature engineering in SRL. Moreover, we extensively experiment with such kernels to investigate their contribution to individual stages of an SRL architecture both in isolation and in combination with other traditional manually coded features. The results for boundary recognition, classification, and re-ranking stages provide systematic evidence about the significant impact of tree kernels on the overall accuracy, especially when the amount of training data is small. As a conclusive result, tree kernels allow for a general and easily portable feature engineering method which is applicable to a large family of natural language processing tasks.

Original languageEnglish
Pages (from-to)193-224
Number of pages32
JournalComputational Linguistics
Issue number2
Publication statusPublished - 1 Jun 2008
Externally publishedYes


ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Theory and Mathematics
  • Linguistics and Language
  • Language and Linguistics

Cite this