The patents retrieval prototype in the MOLTO project

Milen Chechev, Meritxell Gonzàlez, Lluís Màrquez, Cristina España-Bonet

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper describes the patents retrieval prototype developed within the MOLTO project. The prototype aims to provide a multilingual natural language interface for querying the content of patent documents. The developed system is focused on the biomedical and pharmaceutical domain and includes the translation of the patent claims and abstracts into English, French and German. Aiming at the best retrieval results of the patent information and text content, patent documents are preprocessed and semantically annotated. Then, the annotations are stored and indexed in an OWLIM semantic repository, which contains a patent specific ontology and others from different domains. The prototype, accessible online at http://molto-patents.ontotext.com, presents a multilingual natural language interface to query the retrieval system. In MOLTO, the multilingualism of the queries is addressed by means of the GF Tool, which provides an easy way to build and maintain controlled language grammars for interlingual translation in limited domains. The abstract representation obtained from the GF is used to retrieve both the matched RDF instances and the list of patents semantically related to the user's search criteria. The online interface allows to browse the retrieved patents and shows on the text the semantic annotations that explain the reason why any particular patent has matched the user's criteria. Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Original languageEnglish
Title of host publicationWWW'12 - Proceedings of the 21st Annual Conference on World Wide Web Companion
Pages231-234
Number of pages4
DOIs
Publication statusPublished - 21 May 2012
Event21st Annual Conference on World Wide Web, WWW'12 - Lyon, France
Duration: 16 Apr 201220 Apr 2012

Publication series

NameWWW'12 - Proceedings of the 21st Annual Conference on World Wide Web Companion

Other

Other21st Annual Conference on World Wide Web, WWW'12
CountryFrance
CityLyon
Period16/4/1220/4/12

    Fingerprint

Keywords

  • Automatic semantic annotations
  • Multilingual information retrieval
  • Patent translation

ASJC Scopus subject areas

  • Computer Networks and Communications

Cite this

Chechev, M., Gonzàlez, M., Màrquez, L., & España-Bonet, C. (2012). The patents retrieval prototype in the MOLTO project. In WWW'12 - Proceedings of the 21st Annual Conference on World Wide Web Companion (pp. 231-234). (WWW'12 - Proceedings of the 21st Annual Conference on World Wide Web Companion). https://doi.org/10.1145/2187980.2188016