Closure properties of Bulgarian clinical text

Irina Temnikova, Ivelina Nikolova, William A. Baumgartner, Galia Angelova, K. Bretonnel Cohen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Sublanguages are specialized genres of language associated with specific domains and document types. When sublanguages can be recognized and adequately characterized, they are useful for a variety of types of natural language processing applications. Although there are sublanguage studies related to languages other than English, all previous work on sublanguage recognition has focused on sublanguages related to general English. This paper tests whether a sublanguage detecting technique developed for English can be applied to another language. Bulgarian clinical documents are an excellent test case, because of a number of unique linguistic properties that affect their lexical and morphological characteristics. Bulgarian clinical documents were studied with respect to their closure properties and were found to fit the sublanguage model and exhibit characteristics like those noted for sublanguages related to English. It was also confirmed that the clinical sublanguage phenomenon is not a coincidental phenomenon of English, but applies to other languages as well. Implications of this fact for natural language processing are proposed.

Original languageEnglish
Title of host publicationInternational Conference Recent Advances in Natural Language Processing, RANLP
Pages667-675
Number of pages9
Publication statusPublished - 2013
Externally publishedYes
Event9th International Conference on Recent Advances in Natural Language Processing, RANLP 2013 - Hissar, Bulgaria
Duration: 9 Sep 201311 Sep 2013

Other

Other9th International Conference on Recent Advances in Natural Language Processing, RANLP 2013
CountryBulgaria
CityHissar
Period9/9/1311/9/13

Fingerprint

Processing
Linguistics

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Software
  • Electrical and Electronic Engineering

Cite this

Temnikova, I., Nikolova, I., Baumgartner, W. A., Angelova, G., & Cohen, K. B. (2013). Closure properties of Bulgarian clinical text. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 667-675)

Closure properties of Bulgarian clinical text. / Temnikova, Irina; Nikolova, Ivelina; Baumgartner, William A.; Angelova, Galia; Cohen, K. Bretonnel.

International Conference Recent Advances in Natural Language Processing, RANLP. 2013. p. 667-675.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Temnikova, I, Nikolova, I, Baumgartner, WA, Angelova, G & Cohen, KB 2013, Closure properties of Bulgarian clinical text. in International Conference Recent Advances in Natural Language Processing, RANLP. pp. 667-675, 9th International Conference on Recent Advances in Natural Language Processing, RANLP 2013, Hissar, Bulgaria, 9/9/13.
Temnikova I, Nikolova I, Baumgartner WA, Angelova G, Cohen KB. Closure properties of Bulgarian clinical text. In International Conference Recent Advances in Natural Language Processing, RANLP. 2013. p. 667-675
Temnikova, Irina ; Nikolova, Ivelina ; Baumgartner, William A. ; Angelova, Galia ; Cohen, K. Bretonnel. / Closure properties of Bulgarian clinical text. International Conference Recent Advances in Natural Language Processing, RANLP. 2013. pp. 667-675
@inproceedings{d60322f7b0b2467c95202b4c12c8d80a,
title = "Closure properties of Bulgarian clinical text",
abstract = "Sublanguages are specialized genres of language associated with specific domains and document types. When sublanguages can be recognized and adequately characterized, they are useful for a variety of types of natural language processing applications. Although there are sublanguage studies related to languages other than English, all previous work on sublanguage recognition has focused on sublanguages related to general English. This paper tests whether a sublanguage detecting technique developed for English can be applied to another language. Bulgarian clinical documents are an excellent test case, because of a number of unique linguistic properties that affect their lexical and morphological characteristics. Bulgarian clinical documents were studied with respect to their closure properties and were found to fit the sublanguage model and exhibit characteristics like those noted for sublanguages related to English. It was also confirmed that the clinical sublanguage phenomenon is not a coincidental phenomenon of English, but applies to other languages as well. Implications of this fact for natural language processing are proposed.",
author = "Irina Temnikova and Ivelina Nikolova and Baumgartner, {William A.} and Galia Angelova and Cohen, {K. Bretonnel}",
year = "2013",
language = "English",
pages = "667--675",
booktitle = "International Conference Recent Advances in Natural Language Processing, RANLP",

}

TY - GEN

T1 - Closure properties of Bulgarian clinical text

AU - Temnikova, Irina

AU - Nikolova, Ivelina

AU - Baumgartner, William A.

AU - Angelova, Galia

AU - Cohen, K. Bretonnel

PY - 2013

Y1 - 2013

N2 - Sublanguages are specialized genres of language associated with specific domains and document types. When sublanguages can be recognized and adequately characterized, they are useful for a variety of types of natural language processing applications. Although there are sublanguage studies related to languages other than English, all previous work on sublanguage recognition has focused on sublanguages related to general English. This paper tests whether a sublanguage detecting technique developed for English can be applied to another language. Bulgarian clinical documents are an excellent test case, because of a number of unique linguistic properties that affect their lexical and morphological characteristics. Bulgarian clinical documents were studied with respect to their closure properties and were found to fit the sublanguage model and exhibit characteristics like those noted for sublanguages related to English. It was also confirmed that the clinical sublanguage phenomenon is not a coincidental phenomenon of English, but applies to other languages as well. Implications of this fact for natural language processing are proposed.

AB - Sublanguages are specialized genres of language associated with specific domains and document types. When sublanguages can be recognized and adequately characterized, they are useful for a variety of types of natural language processing applications. Although there are sublanguage studies related to languages other than English, all previous work on sublanguage recognition has focused on sublanguages related to general English. This paper tests whether a sublanguage detecting technique developed for English can be applied to another language. Bulgarian clinical documents are an excellent test case, because of a number of unique linguistic properties that affect their lexical and morphological characteristics. Bulgarian clinical documents were studied with respect to their closure properties and were found to fit the sublanguage model and exhibit characteristics like those noted for sublanguages related to English. It was also confirmed that the clinical sublanguage phenomenon is not a coincidental phenomenon of English, but applies to other languages as well. Implications of this fact for natural language processing are proposed.

UR - http://www.scopus.com/inward/record.url?scp=84890499568&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890499568&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84890499568

SP - 667

EP - 675

BT - International Conference Recent Advances in Natural Language Processing, RANLP

ER -