Arabic collocations extraction using gate

Soraya Zaidi, M. T. Laskri, Ahmed Abdelali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Information extraction (IE) from corpora is texts analysis in order to extract structured information such as Named Entities (NE) which may be names of person, organization, address, date, location etc⋯. GATE is a software toolkit written in Java from 1995 and widely used worldwide by many communities (scientists, companies, teachers, students) for natural language processing. We have experimented Gate for extracting terms by writing new Jape rules (Java Annotation Pattern Engine) and used them on a tagged corpus developed at Leeds University. These terms will be used in the texts-based ontologies building. In our case this ontology will be incorporated into a search engine to expand queries on the Web, in the specified domain.

Original languageEnglish
Title of host publication2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings
Pages473-475
Number of pages3
DOIs
Publication statusPublished - 1 Dec 2010
Externally publishedYes
Event1st International Conference on Machine and Web Intelligence, ICMWI 2010 - Algiers, Algeria
Duration: 3 Oct 20105 Oct 2010

Other

Other1st International Conference on Machine and Web Intelligence, ICMWI 2010
CountryAlgeria
CityAlgiers
Period3/10/105/10/10

Fingerprint

Ontology
Search engines
World Wide Web
Students
Engines
Processing
Industry

Keywords

  • Collocation extraction
  • GATE
  • JAPE
  • NLP
  • Ontologies
  • Textual engineering

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture

Cite this

Zaidi, S., Laskri, M. T., & Abdelali, A. (2010). Arabic collocations extraction using gate. In 2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings (pp. 473-475). [5648038] https://doi.org/10.1109/ICMWI.2010.5648038

Arabic collocations extraction using gate. / Zaidi, Soraya; Laskri, M. T.; Abdelali, Ahmed.

2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings. 2010. p. 473-475 5648038.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zaidi, S, Laskri, MT & Abdelali, A 2010, Arabic collocations extraction using gate. in 2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings., 5648038, pp. 473-475, 1st International Conference on Machine and Web Intelligence, ICMWI 2010, Algiers, Algeria, 3/10/10. https://doi.org/10.1109/ICMWI.2010.5648038
Zaidi S, Laskri MT, Abdelali A. Arabic collocations extraction using gate. In 2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings. 2010. p. 473-475. 5648038 https://doi.org/10.1109/ICMWI.2010.5648038
Zaidi, Soraya ; Laskri, M. T. ; Abdelali, Ahmed. / Arabic collocations extraction using gate. 2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings. 2010. pp. 473-475
@inproceedings{4dc9d0cade7241c59f09bc039e42c12b,
title = "Arabic collocations extraction using gate",
abstract = "Information extraction (IE) from corpora is texts analysis in order to extract structured information such as Named Entities (NE) which may be names of person, organization, address, date, location etc⋯. GATE is a software toolkit written in Java from 1995 and widely used worldwide by many communities (scientists, companies, teachers, students) for natural language processing. We have experimented Gate for extracting terms by writing new Jape rules (Java Annotation Pattern Engine) and used them on a tagged corpus developed at Leeds University. These terms will be used in the texts-based ontologies building. In our case this ontology will be incorporated into a search engine to expand queries on the Web, in the specified domain.",
keywords = "Collocation extraction, GATE, JAPE, NLP, Ontologies, Textual engineering",
author = "Soraya Zaidi and Laskri, {M. T.} and Ahmed Abdelali",
year = "2010",
month = "12",
day = "1",
doi = "10.1109/ICMWI.2010.5648038",
language = "English",
isbn = "9781424486113",
pages = "473--475",
booktitle = "2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings",

}

TY - GEN

T1 - Arabic collocations extraction using gate

AU - Zaidi, Soraya

AU - Laskri, M. T.

AU - Abdelali, Ahmed

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Information extraction (IE) from corpora is texts analysis in order to extract structured information such as Named Entities (NE) which may be names of person, organization, address, date, location etc⋯. GATE is a software toolkit written in Java from 1995 and widely used worldwide by many communities (scientists, companies, teachers, students) for natural language processing. We have experimented Gate for extracting terms by writing new Jape rules (Java Annotation Pattern Engine) and used them on a tagged corpus developed at Leeds University. These terms will be used in the texts-based ontologies building. In our case this ontology will be incorporated into a search engine to expand queries on the Web, in the specified domain.

AB - Information extraction (IE) from corpora is texts analysis in order to extract structured information such as Named Entities (NE) which may be names of person, organization, address, date, location etc⋯. GATE is a software toolkit written in Java from 1995 and widely used worldwide by many communities (scientists, companies, teachers, students) for natural language processing. We have experimented Gate for extracting terms by writing new Jape rules (Java Annotation Pattern Engine) and used them on a tagged corpus developed at Leeds University. These terms will be used in the texts-based ontologies building. In our case this ontology will be incorporated into a search engine to expand queries on the Web, in the specified domain.

KW - Collocation extraction

KW - GATE

KW - JAPE

KW - NLP

KW - Ontologies

KW - Textual engineering

UR - http://www.scopus.com/inward/record.url?scp=78650636366&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650636366&partnerID=8YFLogxK

U2 - 10.1109/ICMWI.2010.5648038

DO - 10.1109/ICMWI.2010.5648038

M3 - Conference contribution

SN - 9781424486113

SP - 473

EP - 475

BT - 2010 International Conference on Machine and Web Intelligence, ICMWI 2010 - Proceedings

ER -