Wide-coverage spanish named entity extraction

Xavier Carreras, Lluis Marques, Lluís Padró

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper presents a proposal for wide-coverage Named EntityExtraction for Spanish. The extraction of named entities is treated using robust Machine Learning techniques (AdaBoost) and simple attributes requiring non-linguisticallypro cessed corpora, complemented with external information sources (a list of trigger words and a gazetteer). A thorough evaluation of the task on real corpora is presented in order to validate the appropriateness of the approach. The non linguistic nature of used features makes the approach easilyp ortable to other languages.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages674-683
Number of pages10
Volume2527 LNAI
Publication statusPublished - 1 Dec 2002
Externally publishedYes
Event8th Ibero-American Conference on Artificial Intelligence, IBERAMIA 2002 - Seville, Spain
Duration: 12 Nov 200215 Nov 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2527 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other8th Ibero-American Conference on Artificial Intelligence, IBERAMIA 2002
CountrySpain
CitySeville
Period12/11/0215/11/02

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Carreras, X., Marques, L., & Padró, L. (2002). Wide-coverage spanish named entity extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2527 LNAI, pp. 674-683). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2527 LNAI).