Statistical dependency parsing of Turkish

Gülşen Eryiǧit, Kemal Oflazer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

25 Citations (Scopus)

Abstract

This paper presents results from the first statistical dependency parser for Turkish. Turkish is a free-constituent order language with complex agglutinative inflectional and derivational morphology and presents interesting challenges for statistical parsing, as in general, dependency relations are between "portions" of words - called inflectional groups. We have explored statistical models that use different representational units for parsing. We have used the Turkish Dependency Treebank to train and test our parser but have limited this initial exploration to that subset of the treebank sentences with only left-to-right non-crossing dependency links. Our results indicate that the best accuracy in terms of the dependency relations between inflectional groups is obtained when we use inflectional groups as units in parsing, and when contexts around the dependent are employed.

Original languageEnglish
Title of host publicationEACL 2006 - 11th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference
Pages89-96
Number of pages8
Publication statusPublished - 2006
Externally publishedYes
Event11th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2006 - Trento
Duration: 3 Apr 20067 Apr 2006

Other

Other11th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2006
CityTrento
Period3/4/067/4/06

    Fingerprint

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Eryiǧit, G., & Oflazer, K. (2006). Statistical dependency parsing of Turkish. In EACL 2006 - 11th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 89-96)