Closure properties of Bulgarian clinical text

Irina P. Temnikova, Ivelina Nikolova, William A. Baumgartner, Galia Angelova, K. Bretonnel Cohen

Research output: Contribution to journalConference article

7 Citations (Scopus)

Abstract

Sublanguages are specialized genres of language associated with specific domains and document types. When sublanguages can be recognized and adequately characterized, they are useful for a variety of types of natural language processing applications. Although there are sublanguage studies related to languages other than English, all previous work on sublanguage recognition has focused on sublanguages related to general English. This paper tests whether a sublanguage detecting technique developed for English can be applied to another language. Bulgarian clinical documents are an excellent test case, because of a number of unique linguistic properties that affect their lexical and morphological characteristics. Bulgarian clinical documents were studied with respect to their closure properties and were found to fit the sublanguage model and exhibit characteristics like those noted for sublanguages related to English. It was also confirmed that the clinical sublanguage phenomenon is not a coincidental phenomenon of English, but applies to other languages as well. Implications of this fact for natural language processing are proposed.

Original languageEnglish
Pages (from-to)667-675
Number of pages9
JournalInternational Conference Recent Advances in Natural Language Processing, RANLP
Publication statusPublished - 23 Dec 2013
Event9th International Conference on Recent Advances in Natural Language Processing, RANLP 2013 - Hissar, Bulgaria
Duration: 9 Sep 201311 Sep 2013

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Closure properties of Bulgarian clinical text'. Together they form a unique fingerprint.

  • Cite this