Classification of lexical stress patterns using deep neural network architecture

Mostafa Ali Shahin, Beena Ahmed, Kirrie J. Ballard

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Lexical stress is a key diagnostic marker of disordered speech as it strongly affects speech perception. In this paper we introduce an automated method to classify between the different lexical stress patterns in children's speech. A deep neural network is used to classify between strong-weak (SW), weak-strong (WS) and equal-stress (SS/WW) patterns in English by measuring the articulation change between the two successive syllables. The deep neural network architecture is trained using a set of acoustic features derived from pitch, duration and intensity measurements along with the energies in different frequency bands. We compared the performance of the deep neural classifier to a traditional single hidden layer MLP. Results show that the deep neural classifier outperforms the traditional MLP. The accuracy of the deep neural system is approximately 85% when classifying between the unequal stress patterns (SW/WS) and greater than 70% when classifying both equal and unequal stress patterns.

Original languageEnglish
Title of host publication2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages478-482
Number of pages5
ISBN (Electronic)9781479971299
DOIs
Publication statusPublished - 1 Apr 2014
Event2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - South Lake Tahoe, United States
Duration: 7 Dec 201410 Dec 2014

Publication series

Name2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings

Other

Other2014 IEEE Workshop on Spoken Language Technology, SLT 2014
CountryUnited States
CitySouth Lake Tahoe
Period7/12/1410/12/14

Keywords

  • Automatic assessment
  • Deep neural network
  • Lexical stress
  • Prosody

ASJC Scopus subject areas

  • Computer Science Applications
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence
  • Language and Linguistics

Fingerprint Dive into the research topics of 'Classification of lexical stress patterns using deep neural network architecture'. Together they form a unique fingerprint.

  • Cite this

    Shahin, M. A., Ahmed, B., & Ballard, K. J. (2014). Classification of lexical stress patterns using deep neural network architecture. In 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings (pp. 478-482). [7078621] (2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SLT.2014.7078621