Semantic mapping between natural language questions and SQL queries via syntactic pairing

Alessandra Giordani, Alessandro Moschitti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

19 Citations (Scopus)

Abstract

Automatically mapping natural language semantics into programming languages has always been a major and interesting challenge in Computer Science. In this paper, we approach such problem by carrying out mapping at syntactic level and then applying machine learning algorithms to derive an automatic translator of natural language questions into their associated SQL queries. To build the required training and test sets, we designed an algorithm, which, given an initial corpus of questions and their answers, semi-automatically generates the set of possible incorrect and correct pairs. We encode such relational pairs in Support Vector Machines by means of kernel functions applied to the syntactic trees of questions and queries. The accurate results on automatic classification of the above pairs above, suggest that our approach captures the shared semantics between the two languages.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages207-221
Number of pages15
Volume5723 LNCS
DOIs
Publication statusPublished - 1 Dec 2009
Externally publishedYes
Event14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009 - Saarbrucken, Germany
Duration: 24 Jun 200926 Jun 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5723 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009
CountryGermany
CitySaarbrucken
Period24/6/0926/6/09

Fingerprint

Syntactics
Pairing
Natural Language
Semantics
Query
Computer programming languages
Computer science
Learning algorithms
Support vector machines
Learning systems
Test Set
Kernel Function
Programming Languages
Learning Algorithm
Support Vector Machine
Machine Learning
Computer Science
Syntax

Keywords

  • Kernel methods
  • Natural language processing
  • Support vector machines

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Giordani, A., & Moschitti, A. (2009). Semantic mapping between natural language questions and SQL queries via syntactic pairing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5723 LNCS, pp. 207-221). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5723 LNCS). https://doi.org/10.1007/978-3-642-12550-8_17

Semantic mapping between natural language questions and SQL queries via syntactic pairing. / Giordani, Alessandra; Moschitti, Alessandro.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5723 LNCS 2009. p. 207-221 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5723 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Giordani, A & Moschitti, A 2009, Semantic mapping between natural language questions and SQL queries via syntactic pairing. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5723 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5723 LNCS, pp. 207-221, 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrucken, Germany, 24/6/09. https://doi.org/10.1007/978-3-642-12550-8_17
Giordani A, Moschitti A. Semantic mapping between natural language questions and SQL queries via syntactic pairing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5723 LNCS. 2009. p. 207-221. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-12550-8_17
Giordani, Alessandra ; Moschitti, Alessandro. / Semantic mapping between natural language questions and SQL queries via syntactic pairing. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5723 LNCS 2009. pp. 207-221 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{3a2f613748a54f539ceb62ae5001167a,
title = "Semantic mapping between natural language questions and SQL queries via syntactic pairing",
abstract = "Automatically mapping natural language semantics into programming languages has always been a major and interesting challenge in Computer Science. In this paper, we approach such problem by carrying out mapping at syntactic level and then applying machine learning algorithms to derive an automatic translator of natural language questions into their associated SQL queries. To build the required training and test sets, we designed an algorithm, which, given an initial corpus of questions and their answers, semi-automatically generates the set of possible incorrect and correct pairs. We encode such relational pairs in Support Vector Machines by means of kernel functions applied to the syntactic trees of questions and queries. The accurate results on automatic classification of the above pairs above, suggest that our approach captures the shared semantics between the two languages.",
keywords = "Kernel methods, Natural language processing, Support vector machines",
author = "Alessandra Giordani and Alessandro Moschitti",
year = "2009",
month = "12",
day = "1",
doi = "10.1007/978-3-642-12550-8_17",
language = "English",
isbn = "3642125492",
volume = "5723 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "207--221",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Semantic mapping between natural language questions and SQL queries via syntactic pairing

AU - Giordani, Alessandra

AU - Moschitti, Alessandro

PY - 2009/12/1

Y1 - 2009/12/1

N2 - Automatically mapping natural language semantics into programming languages has always been a major and interesting challenge in Computer Science. In this paper, we approach such problem by carrying out mapping at syntactic level and then applying machine learning algorithms to derive an automatic translator of natural language questions into their associated SQL queries. To build the required training and test sets, we designed an algorithm, which, given an initial corpus of questions and their answers, semi-automatically generates the set of possible incorrect and correct pairs. We encode such relational pairs in Support Vector Machines by means of kernel functions applied to the syntactic trees of questions and queries. The accurate results on automatic classification of the above pairs above, suggest that our approach captures the shared semantics between the two languages.

AB - Automatically mapping natural language semantics into programming languages has always been a major and interesting challenge in Computer Science. In this paper, we approach such problem by carrying out mapping at syntactic level and then applying machine learning algorithms to derive an automatic translator of natural language questions into their associated SQL queries. To build the required training and test sets, we designed an algorithm, which, given an initial corpus of questions and their answers, semi-automatically generates the set of possible incorrect and correct pairs. We encode such relational pairs in Support Vector Machines by means of kernel functions applied to the syntactic trees of questions and queries. The accurate results on automatic classification of the above pairs above, suggest that our approach captures the shared semantics between the two languages.

KW - Kernel methods

KW - Natural language processing

KW - Support vector machines

UR - http://www.scopus.com/inward/record.url?scp=78651229274&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78651229274&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-12550-8_17

DO - 10.1007/978-3-642-12550-8_17

M3 - Conference contribution

SN - 3642125492

SN - 9783642125492

VL - 5723 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 207

EP - 221

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -