Two-level description of turkish morphology

Kemal Oflazer

Research output: Contribution to journalArticle

149 Citations (Scopus)

Abstract

This paper describes a full two-level morphological description of Turkish word structures. The description has been implemented using the PC-KIMMO environment and is based on a root word lexicon of about 23,000 root words. The phonetic rules of contemporary Turkish (spoken in Turkey) have been encoded using 22 two-level rules while the morphotactics of the agglutinative word structures have been encoded as finite-state machines for verbal, nominal paradigms and other categories. Almost all the special cases of, and exceptions to phonological and morphological rules have been taken into account. In this paper, we describe the rules and the finite state machines along with examples and a discussion of how various special cases were handled. We also describe some known limitations and problems with this description.

Original languageEnglish
Pages (from-to)137-148
Number of pages12
JournalLiterary and Linguistic Computing
Volume9
Issue number2
DOIs
Publication statusPublished - 1994
Externally publishedYes

Fingerprint

Phonetics
Finite automata
Turkey
Speech analysis
phonetics
PC
paradigm
Word Structure
Root Words
Paradigm
Lexicon
Morphological Rules
Phonological Rules

ASJC Scopus subject areas

  • Pharmacology
  • Neuroscience(all)
  • Immunology and Microbiology(all)
  • Pathology and Forensic Medicine
  • Safety, Risk, Reliability and Quality
  • Information Systems
  • Linguistics and Language

Cite this

Two-level description of turkish morphology. / Oflazer, Kemal.

In: Literary and Linguistic Computing, Vol. 9, No. 2, 1994, p. 137-148.

Research output: Contribution to journalArticle

Oflazer, Kemal. / Two-level description of turkish morphology. In: Literary and Linguistic Computing. 1994 ; Vol. 9, No. 2. pp. 137-148.
@article{56661a159e8e405b84f818762ed472cb,
title = "Two-level description of turkish morphology",
abstract = "This paper describes a full two-level morphological description of Turkish word structures. The description has been implemented using the PC-KIMMO environment and is based on a root word lexicon of about 23,000 root words. The phonetic rules of contemporary Turkish (spoken in Turkey) have been encoded using 22 two-level rules while the morphotactics of the agglutinative word structures have been encoded as finite-state machines for verbal, nominal paradigms and other categories. Almost all the special cases of, and exceptions to phonological and morphological rules have been taken into account. In this paper, we describe the rules and the finite state machines along with examples and a discussion of how various special cases were handled. We also describe some known limitations and problems with this description.",
author = "Kemal Oflazer",
year = "1994",
doi = "10.1093/llc/9.2.137",
language = "English",
volume = "9",
pages = "137--148",
journal = "Literary and Linguistics Computing",
issn = "0268-1145",
publisher = "Oxford University Press",
number = "2",

}

TY - JOUR

T1 - Two-level description of turkish morphology

AU - Oflazer, Kemal

PY - 1994

Y1 - 1994

N2 - This paper describes a full two-level morphological description of Turkish word structures. The description has been implemented using the PC-KIMMO environment and is based on a root word lexicon of about 23,000 root words. The phonetic rules of contemporary Turkish (spoken in Turkey) have been encoded using 22 two-level rules while the morphotactics of the agglutinative word structures have been encoded as finite-state machines for verbal, nominal paradigms and other categories. Almost all the special cases of, and exceptions to phonological and morphological rules have been taken into account. In this paper, we describe the rules and the finite state machines along with examples and a discussion of how various special cases were handled. We also describe some known limitations and problems with this description.

AB - This paper describes a full two-level morphological description of Turkish word structures. The description has been implemented using the PC-KIMMO environment and is based on a root word lexicon of about 23,000 root words. The phonetic rules of contemporary Turkish (spoken in Turkey) have been encoded using 22 two-level rules while the morphotactics of the agglutinative word structures have been encoded as finite-state machines for verbal, nominal paradigms and other categories. Almost all the special cases of, and exceptions to phonological and morphological rules have been taken into account. In this paper, we describe the rules and the finite state machines along with examples and a discussion of how various special cases were handled. We also describe some known limitations and problems with this description.

UR - http://www.scopus.com/inward/record.url?scp=0002230550&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0002230550&partnerID=8YFLogxK

U2 - 10.1093/llc/9.2.137

DO - 10.1093/llc/9.2.137

M3 - Article

VL - 9

SP - 137

EP - 148

JO - Literary and Linguistics Computing

JF - Literary and Linguistics Computing

SN - 0268-1145

IS - 2

ER -