Toward an efficient Arabic part of speech Tagger

Ahmed Abdelali, Yahya O Mohamed Elhadj, Rachid Bouziane

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability is not trivial. In this paper we present an effort to design and build a POS tagger that would take into consideration the richness of the language as well as the efficiency in processing volumes of text. The Light Arabic Part of Speech Tagger (LAPOST) current output is very comparable to existing system but more effective from the processing perspective.

Original languageEnglish
Title of host publicationProceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
DOIs
Publication statusPublished - 13 Nov 2013
Event2013 IEEE and Arab Computing Society (ACS) International Conference on Computer Systems and Applications, AICCSA 2013 - Ifrane, Morocco
Duration: 27 May 201330 May 2013

Other

Other2013 IEEE and Arab Computing Society (ACS) International Conference on Computer Systems and Applications, AICCSA 2013
CountryMorocco
CityIfrane
Period27/5/1330/5/13

Fingerprint

Processing
Availability

Keywords

  • Arabic Language
  • Linguistic Features
  • Morphology
  • Natural Language Processing
  • Part of Speech
  • Syntax
  • Tagging

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Signal Processing
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Abdelali, A., Elhadj, Y. O. M., & Bouziane, R. (2013). Toward an efficient Arabic part of speech Tagger. In Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA [6616446] https://doi.org/10.1109/AICCSA.2013.6616446

Toward an efficient Arabic part of speech Tagger. / Abdelali, Ahmed; Elhadj, Yahya O Mohamed; Bouziane, Rachid.

Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA. 2013. 6616446.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abdelali, A, Elhadj, YOM & Bouziane, R 2013, Toward an efficient Arabic part of speech Tagger. in Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA., 6616446, 2013 IEEE and Arab Computing Society (ACS) International Conference on Computer Systems and Applications, AICCSA 2013, Ifrane, Morocco, 27/5/13. https://doi.org/10.1109/AICCSA.2013.6616446
Abdelali A, Elhadj YOM, Bouziane R. Toward an efficient Arabic part of speech Tagger. In Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA. 2013. 6616446 https://doi.org/10.1109/AICCSA.2013.6616446
Abdelali, Ahmed ; Elhadj, Yahya O Mohamed ; Bouziane, Rachid. / Toward an efficient Arabic part of speech Tagger. Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA. 2013.
@inproceedings{bc6ef460e9d5480e8b61decd1f6cf581,
title = "Toward an efficient Arabic part of speech Tagger",
abstract = "The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability is not trivial. In this paper we present an effort to design and build a POS tagger that would take into consideration the richness of the language as well as the efficiency in processing volumes of text. The Light Arabic Part of Speech Tagger (LAPOST) current output is very comparable to existing system but more effective from the processing perspective.",
keywords = "Arabic Language, Linguistic Features, Morphology, Natural Language Processing, Part of Speech, Syntax, Tagging",
author = "Ahmed Abdelali and Elhadj, {Yahya O Mohamed} and Rachid Bouziane",
year = "2013",
month = "11",
day = "13",
doi = "10.1109/AICCSA.2013.6616446",
language = "English",
isbn = "9781479907922",
booktitle = "Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA",

}

TY - GEN

T1 - Toward an efficient Arabic part of speech Tagger

AU - Abdelali, Ahmed

AU - Elhadj, Yahya O Mohamed

AU - Bouziane, Rachid

PY - 2013/11/13

Y1 - 2013/11/13

N2 - The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability is not trivial. In this paper we present an effort to design and build a POS tagger that would take into consideration the richness of the language as well as the efficiency in processing volumes of text. The Light Arabic Part of Speech Tagger (LAPOST) current output is very comparable to existing system but more effective from the processing perspective.

AB - The task of tagging and allotting the correct Part of Speech (POS) to text given its context is not obvious and requires expertise and use of considerable resources. Automating such task and building tools that can carry such job is crucial and imperative to advance in major areas of natural language processing. A limited numbers of Part of Speech Taggers exist currently for Arabic and their availability is not trivial. In this paper we present an effort to design and build a POS tagger that would take into consideration the richness of the language as well as the efficiency in processing volumes of text. The Light Arabic Part of Speech Tagger (LAPOST) current output is very comparable to existing system but more effective from the processing perspective.

KW - Arabic Language

KW - Linguistic Features

KW - Morphology

KW - Natural Language Processing

KW - Part of Speech

KW - Syntax

KW - Tagging

UR - http://www.scopus.com/inward/record.url?scp=84887218512&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84887218512&partnerID=8YFLogxK

U2 - 10.1109/AICCSA.2013.6616446

DO - 10.1109/AICCSA.2013.6616446

M3 - Conference contribution

SN - 9781479907922

BT - Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA

ER -