Boosting applied to word sense disambiguation

Gerard Escudero, Lluis Marques, German Rigau

Research output: Chapter in Book/Report/Conference proceedingConference contribution

61 Citations (Scopus)

Abstract

In this paper Schapire and Singer’s AdaBoost.MH boosting algorithm is applied to the Word Sense Disambiguation (WSD)pro blem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach surpasses Naive Bayes and Exemplar–based approaches, which represent state–of–the–art accuracy on supervised WSD. In order to make boosting practical for a real learning domain of thousands of words, several ways of accelerating the algorithm by reducing the feature space are studied. The best variant, which we call LazyBoosting, is tested on the largest sense–tagged corpus available containing 192,800 examples of the 191 most frequent and ambiguous English words. Again, boosting compares favourably to the other benchmark algorithms.

Original languageEnglish
Title of host publicationMachine Learning: ECML 2000 - 11th European Conference on Machine Learning, Proceedings
PublisherSpringer Verlag
Pages129-141
Number of pages13
Volume1810
ISBN (Print)9783540451648
Publication statusPublished - 2000
Externally publishedYes
Event11th European Conference on Machine Learning, ECML 2000 - Barcelona, Catalonia, Spain
Duration: 31 May 20002 Jun 2000

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1810
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other11th European Conference on Machine Learning, ECML 2000
CountrySpain
CityBarcelona, Catalonia
Period31/5/002/6/00

    Fingerprint

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Escudero, G., Marques, L., & Rigau, G. (2000). Boosting applied to word sense disambiguation. In Machine Learning: ECML 2000 - 11th European Conference on Machine Learning, Proceedings (Vol. 1810, pp. 129-141). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1810). Springer Verlag.