SVMTool: A general POS tagger generator based on support vector machines

Jesús Giménez, Lluis Marques

Research output: Chapter in Book/Report/Conference proceedingConference contribution

200 Citations (Scopus)

Abstract

This paper presents the SVMTool, a simple, flexible, effective and efficient part-of-speech tagger based on Support Vector Machines. The SVMTool offers a fairly good balance among these properties which make it really practical for current NLP applications. It is very easy to use and easily configurable so as to perfectly fit the needs of a number of different applications. Results are also very competitive, achieving an accuracy of 97.16% for English on the Wall Street Journal corpus. It has been also successfully applied to Spanish exhibiting a similar performance. A first release of the SVMTool Perl prototype is now freely available for public use. A most efficient C++ version is coming very soon.

Original languageEnglish
Title of host publicationProceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004
PublisherEuropean Language Resources Association (ELRA)
Pages43-46
Number of pages4
ISBN (Electronic)2951740816, 9782951740815
Publication statusPublished - 1 Jan 2004
Event4th International Conference on Language Resources and Evaluation, LREC 2004 - Lisbon, Portugal
Duration: 26 May 200428 May 2004

Other

Other4th International Conference on Language Resources and Evaluation, LREC 2004
CountryPortugal
CityLisbon
Period26/5/0428/5/04

    Fingerprint

ASJC Scopus subject areas

  • Library and Information Sciences
  • Education
  • Language and Linguistics
  • Linguistics and Language

Cite this

Giménez, J., & Marques, L. (2004). SVMTool: A general POS tagger generator based on support vector machines. In Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp. 43-46). European Language Resources Association (ELRA).