Semantic annotations for conversational speech

From speech transcriptions to predicate argument structures

Arianna Bisazza, Marco Dinarelli, Silvia Quarteroni, Sara Tonelli, Alessandro Moschitti, Giuseppe Riccardi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, we describe the semantic content, which canbe automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts appear to be promising for the design of more complex dialog systems. Statistics about our semantic annotation are reported.

Original languageEnglish
Title of host publication2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings
Pages65-68
Number of pages4
DOIs
Publication statusPublished - 1 Dec 2008
Externally publishedYes
Event2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Goa, India
Duration: 15 Dec 200819 Dec 2008

Other

Other2008 IEEE Workshop on Spoken Language Technology, SLT 2008
CountryIndia
CityGoa
Period15/12/0819/12/08

Fingerprint

Transcription
communication technology
Semantics
semantics
Taxonomies
Syntactics
Linguistics
taxonomy
Learning systems
Large scale systems
dialogue
statistics
Statistics
linguistics
learning
Dialogue Systems
Annotation
Argument Structure
Taxonomy
Constituent

ASJC Scopus subject areas

  • Language and Linguistics
  • Software
  • Electrical and Electronic Engineering
  • Communication

Cite this

Bisazza, A., Dinarelli, M., Quarteroni, S., Tonelli, S., Moschitti, A., & Riccardi, G. (2008). Semantic annotations for conversational speech: From speech transcriptions to predicate argument structures. In 2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings (pp. 65-68). [4777841] https://doi.org/10.1109/SLT.2008.4777841

Semantic annotations for conversational speech : From speech transcriptions to predicate argument structures. / Bisazza, Arianna; Dinarelli, Marco; Quarteroni, Silvia; Tonelli, Sara; Moschitti, Alessandro; Riccardi, Giuseppe.

2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings. 2008. p. 65-68 4777841.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bisazza, A, Dinarelli, M, Quarteroni, S, Tonelli, S, Moschitti, A & Riccardi, G 2008, Semantic annotations for conversational speech: From speech transcriptions to predicate argument structures. in 2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings., 4777841, pp. 65-68, 2008 IEEE Workshop on Spoken Language Technology, SLT 2008, Goa, India, 15/12/08. https://doi.org/10.1109/SLT.2008.4777841
Bisazza A, Dinarelli M, Quarteroni S, Tonelli S, Moschitti A, Riccardi G. Semantic annotations for conversational speech: From speech transcriptions to predicate argument structures. In 2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings. 2008. p. 65-68. 4777841 https://doi.org/10.1109/SLT.2008.4777841
Bisazza, Arianna ; Dinarelli, Marco ; Quarteroni, Silvia ; Tonelli, Sara ; Moschitti, Alessandro ; Riccardi, Giuseppe. / Semantic annotations for conversational speech : From speech transcriptions to predicate argument structures. 2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings. 2008. pp. 65-68
@inproceedings{1457c4adbd6f448580481e65d2702ffe,
title = "Semantic annotations for conversational speech: From speech transcriptions to predicate argument structures",
abstract = "In this paper, we describe the semantic content, which canbe automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts appear to be promising for the design of more complex dialog systems. Statistics about our semantic annotation are reported.",
author = "Arianna Bisazza and Marco Dinarelli and Silvia Quarteroni and Sara Tonelli and Alessandro Moschitti and Giuseppe Riccardi",
year = "2008",
month = "12",
day = "1",
doi = "10.1109/SLT.2008.4777841",
language = "English",
isbn = "9781424434725",
pages = "65--68",
booktitle = "2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings",

}

TY - GEN

T1 - Semantic annotations for conversational speech

T2 - From speech transcriptions to predicate argument structures

AU - Bisazza, Arianna

AU - Dinarelli, Marco

AU - Quarteroni, Silvia

AU - Tonelli, Sara

AU - Moschitti, Alessandro

AU - Riccardi, Giuseppe

PY - 2008/12/1

Y1 - 2008/12/1

N2 - In this paper, we describe the semantic content, which canbe automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts appear to be promising for the design of more complex dialog systems. Statistics about our semantic annotation are reported.

AB - In this paper, we describe the semantic content, which canbe automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts appear to be promising for the design of more complex dialog systems. Statistics about our semantic annotation are reported.

UR - http://www.scopus.com/inward/record.url?scp=67649528019&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67649528019&partnerID=8YFLogxK

U2 - 10.1109/SLT.2008.4777841

DO - 10.1109/SLT.2008.4777841

M3 - Conference contribution

SN - 9781424434725

SP - 65

EP - 68

BT - 2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings

ER -