Automating ontological annotation with WordNet

Antonio Sanfilippo, Stephen Tratz, Michelle Gregory, Alan Chappell, Paul Whitney, Christian Posse, Patrick Paulson, Bob Baddeley, Ryan Hohimer, Amanda White

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of Word-Net to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

Original languageEnglish
Title of host publicationGWC 2006: 3rd International Global WordNet Conference, Proceedings
PublisherMasaryk University
Pages85-93
Number of pages9
ISBN (Print)8021039159, 9788021039155
Publication statusPublished - 2005
Externally publishedYes
Event3rd International Global WordNet Conference, GWC 2006 - Jeju Island
Duration: 22 Jan 200626 Jan 2006

Other

Other3rd International Global WordNet Conference, GWC 2006
CityJeju Island
Period22/1/0626/1/06

Fingerprint

Annotation
Ontological
WordNet
Ontology
Disambiguation
Word Sense
Assignment
Inference
Lexical Database
Synonyms
Labor
Conceptual Representation
Semantic Web

ASJC Scopus subject areas

  • Language and Linguistics
  • Literature and Literary Theory

Cite this

Sanfilippo, A., Tratz, S., Gregory, M., Chappell, A., Whitney, P., Posse, C., ... White, A. (2005). Automating ontological annotation with WordNet. In GWC 2006: 3rd International Global WordNet Conference, Proceedings (pp. 85-93). Masaryk University.

Automating ontological annotation with WordNet. / Sanfilippo, Antonio; Tratz, Stephen; Gregory, Michelle; Chappell, Alan; Whitney, Paul; Posse, Christian; Paulson, Patrick; Baddeley, Bob; Hohimer, Ryan; White, Amanda.

GWC 2006: 3rd International Global WordNet Conference, Proceedings. Masaryk University, 2005. p. 85-93.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sanfilippo, A, Tratz, S, Gregory, M, Chappell, A, Whitney, P, Posse, C, Paulson, P, Baddeley, B, Hohimer, R & White, A 2005, Automating ontological annotation with WordNet. in GWC 2006: 3rd International Global WordNet Conference, Proceedings. Masaryk University, pp. 85-93, 3rd International Global WordNet Conference, GWC 2006, Jeju Island, 22/1/06.
Sanfilippo A, Tratz S, Gregory M, Chappell A, Whitney P, Posse C et al. Automating ontological annotation with WordNet. In GWC 2006: 3rd International Global WordNet Conference, Proceedings. Masaryk University. 2005. p. 85-93
Sanfilippo, Antonio ; Tratz, Stephen ; Gregory, Michelle ; Chappell, Alan ; Whitney, Paul ; Posse, Christian ; Paulson, Patrick ; Baddeley, Bob ; Hohimer, Ryan ; White, Amanda. / Automating ontological annotation with WordNet. GWC 2006: 3rd International Global WordNet Conference, Proceedings. Masaryk University, 2005. pp. 85-93
@inproceedings{23ce21e274a54015a276bd02c5e34cd9,
title = "Automating ontological annotation with WordNet",
abstract = "Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of Word-Net to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.",
author = "Antonio Sanfilippo and Stephen Tratz and Michelle Gregory and Alan Chappell and Paul Whitney and Christian Posse and Patrick Paulson and Bob Baddeley and Ryan Hohimer and Amanda White",
year = "2005",
language = "English",
isbn = "8021039159",
pages = "85--93",
booktitle = "GWC 2006: 3rd International Global WordNet Conference, Proceedings",
publisher = "Masaryk University",

}

TY - GEN

T1 - Automating ontological annotation with WordNet

AU - Sanfilippo, Antonio

AU - Tratz, Stephen

AU - Gregory, Michelle

AU - Chappell, Alan

AU - Whitney, Paul

AU - Posse, Christian

AU - Paulson, Patrick

AU - Baddeley, Bob

AU - Hohimer, Ryan

AU - White, Amanda

PY - 2005

Y1 - 2005

N2 - Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of Word-Net to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

AB - Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of Word-Net to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

UR - http://www.scopus.com/inward/record.url?scp=84904612042&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84904612042&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84904612042

SN - 8021039159

SN - 9788021039155

SP - 85

EP - 93

BT - GWC 2006: 3rd International Global WordNet Conference, Proceedings

PB - Masaryk University

ER -