A preprocessor for Turkish text analysis

Kemal Oflazer, Özlem Çetinoǧlu, Orhan Bilgin, Bilge Say

Research output: Contribution to journalArticle

Abstract

This paper describes a preprocessor for Turkish text that involves various stages of lexical, morphological and multi-word construct processor for preprocessing Turkish text for various language engineering applications. We present the architecture of the system with special emphasis on how various kinds of collocations and other similar multi-word constructs are handled and present an evaluation from a test corpus.

Original languageEnglish
Pages (from-to)761-770
Number of pages10
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3280
Publication statusPublished - 2004
Externally publishedYes

Fingerprint

Text Analysis
Language
Engineering Application
Collocation
Preprocessing
Evaluation
Text
Architecture
Corpus

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

A preprocessor for Turkish text analysis. / Oflazer, Kemal; Çetinoǧlu, Özlem; Bilgin, Orhan; Say, Bilge.

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 3280, 2004, p. 761-770.

Research output: Contribution to journalArticle

@article{f7a6f9095e2a4eafb337ce68da6225d1,
title = "A preprocessor for Turkish text analysis",
abstract = "This paper describes a preprocessor for Turkish text that involves various stages of lexical, morphological and multi-word construct processor for preprocessing Turkish text for various language engineering applications. We present the architecture of the system with special emphasis on how various kinds of collocations and other similar multi-word constructs are handled and present an evaluation from a test corpus.",
author = "Kemal Oflazer and {\"O}zlem {\cC}etinoǧlu and Orhan Bilgin and Bilge Say",
year = "2004",
language = "English",
volume = "3280",
pages = "761--770",
journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - A preprocessor for Turkish text analysis

AU - Oflazer, Kemal

AU - Çetinoǧlu, Özlem

AU - Bilgin, Orhan

AU - Say, Bilge

PY - 2004

Y1 - 2004

N2 - This paper describes a preprocessor for Turkish text that involves various stages of lexical, morphological and multi-word construct processor for preprocessing Turkish text for various language engineering applications. We present the architecture of the system with special emphasis on how various kinds of collocations and other similar multi-word constructs are handled and present an evaluation from a test corpus.

AB - This paper describes a preprocessor for Turkish text that involves various stages of lexical, morphological and multi-word construct processor for preprocessing Turkish text for various language engineering applications. We present the architecture of the system with special emphasis on how various kinds of collocations and other similar multi-word constructs are handled and present an evaluation from a test corpus.

UR - http://www.scopus.com/inward/record.url?scp=35048841618&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35048841618&partnerID=8YFLogxK

M3 - Article

VL - 3280

SP - 761

EP - 770

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SN - 0302-9743

ER -