Noun compound interpretation using paraphrasing verbs: Feasibility study

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Citations (Scopus)

Abstract

The paper addresses an important challenge for the automatic processing of English written text: understanding noun compounds' semantics. Following Downing (1977) [1], we define noun compounds as sequences of nouns acting as a single noun, e.g., bee honey, apple cake, stem cell, etc. In our view, they are best characterised by the set of all possible paraphrasing verbs that can connect the target nouns, with associated weights, e.g., malaria mosquito can be represented as follows: carry (23), spread (16), cause (12), transmit (9), etc. These verbs are directly usable as paraphrases, and using multiple of them simultaneously yields an appealing fine-grained semantic representation. In the present paper, we describe the process of constructing such representations for 250 noun-noun compounds previously proposed in the linguistic literature by Levi (1978) [2]. In particular, using human subjects recruited through Amazon Mechanical Turk Web Service, we create a valuable manually-annotated resource for noun compound interpretation, which we make publicly available with the hope to inspire further research in paraphrase-based noun compound interpretation. We further perform a number of experiments, including a comparison to automatically generated weight vectors, in order to assess the dataset quality and the feasibility of the idea of using paraphrasing verbs to characterise noun compounds' semantics; the results are quite promising.

Original languageEnglish
Title of host publicationArtificial Intelligence
Subtitle of host publicationMethodology, Systems, and Applications - 13th International Conference, AIMSA 2008, Proceedings
Pages103-117
Number of pages15
DOIs
Publication statusPublished - 25 Sep 2008
Event13th International Conference on Artificial Intelligence: Methodology, Systems, and Applications, AIMSA 2008 - Varna, Bulgaria
Duration: 4 Sep 20086 Sep 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5253 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other13th International Conference on Artificial Intelligence: Methodology, Systems, and Applications, AIMSA 2008
CountryBulgaria
CityVarna
Period4/9/086/9/08

    Fingerprint

Keywords

  • Lexical semantics
  • Noun compounds
  • Paraphrasing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Nakov, P. (2008). Noun compound interpretation using paraphrasing verbs: Feasibility study. In Artificial Intelligence: Methodology, Systems, and Applications - 13th International Conference, AIMSA 2008, Proceedings (pp. 103-117). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5253 LNAI). https://doi.org/10.1007/978-3-540-85776-1_10